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(Document name) 

(Title of the invention) 

pi , 3-GALACTOSYLTRANSFERASE AND DNA ENCODING THE 

SAME 

(Scope of the claims) 

(Claim 1) A protein having a pi, 3- 

galactosyltransf erase activity derived from a microorganism 
having an activity of transferring galactose to 
N-acetylglucosamine with pi , 3-linkage . 

(Claim 2) The protein according to claim 1, 
wherein the microorganism belongs to the genus 
Streptococcus . 

(Claim 3) The protein according to claim 2, 
wherein the microorganism is Streptococcus agalactiae . 

(Claim 4) A protein comprising the amino acid 
sequence represented by SEQ ID N0:1. 

(Claim 5) A protein comprising an amino acid 
sequence in which at least one amino acid is deleted, 
replaced, inserted or added in the amino acid sequence 
represented by SEQ ID N0:1, said protein having a 
pi , 3-galactosyltransf erase activity. 

(Claim 6) A DNA encoding the protein of any one of 
claims 1 to 5 . 

(Claim 7) A DNA comprising the nucleotide sequence 
represented by SEQ ID NO: 2. 
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(Claim 8) A DNA which hybridizes with a DNA 
comprising the complementary sequence to the nucleotide 
sequence represented by SEQ ID NO: 2 under stringent 
conditions , and encodes a protein having a 
(31 , 3-galactosyltransf erase activity . 

(Claim 9) A recombinant DNA comprising the DNA of 
any one of claims 6 to 8 and a vector. 

(Claim 10) A transformant obtained by introducing 
the recombinant DNA of claim 9 into a host cell. 

(Claim 11) The transformant according to claim 10, 
wherein the host cell is a microorganism. 

(Claim 12) The transformant according to claim 11, 
wherein the microorganism belongs to the genus Escherichia . 

(Claim 13) The transformant according to claim 12, 
wherein the microorganism belonging to the genus 
Escherichia is Escherichia coli. 

(Claim 14) A method for producing a protein having 
a pi , 3-galactosyltransf erase activity, comprising culturing 
the transformant of any one of claims 10 to 13 in a medium 
to produce and accumulate a protein having a pi,3- 
galactosyltransf erase activity in the culture, and 
recovering the protein from the culture. 

(Claim 15) A method for producing a galactose- 
containing carbohydrate, comprising selecting, as an enzyme 
source, a culture of the transformant of any one of claims 
10 to 13 or a treated product of the culture, allowing the 
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enzyme source, uridine-5 ' -diphosphogalactose and an 
acceptor carbohydrate to be present in an aqueous medium to 
produce and accumulate the galactose-containing 
carbohydrate in the aqueous medium, and recovering the 
galactose-containing carbohydrate from the aqueous medium. 

(Claim 16) The method according to claim 15, 
wherein the treated product of the culture is selected from 
the group consisting of a concentrated product of the 
culture, a dried product of the culture, cells obtained by 
centrifuging the culture, a dried product of the cells, a 
freeze-dried product of the cells, a surfactant-treated 
product of the cells, an ultrasonic-treated product of the 
cells, a mechanically disrupted product of the cells , a 
solvent-treated product- of the cells, an enzyme-treated 
product of the cells, a. protein fraction of the cells, an 
immobilized product of the cells and an enzyme preparation 
obtained by extracting from the cells. 

(Claim 17) The method according to claim 15, 
wherein the acceptor carbohydrate is a carbohydrate having 
N-acetylglucosamine at its non-reducing terminal. 

(Claim 18) The method according to claim 15, 
wherein the acceptor carbohydrate is selected from the 
group consisting of W-acetylglucosamine and lacto-AT-triose 
II. 

(Claim 19) The method according to claim 15, 
wherein the galactose-containing carbohydrate is selected 
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from the group consisting of lacto-iV-biose and lacto- 
N-tetraose . 

(Detailed description of the invention) 
(0001) 

(Technical filed to which the invention belongs) 

The present invention relates to a protein having a 
pi , 3-galactosyltransf erase activity, a DNA encoding the 
protein, a recombinant DNA containing the DNA, a 
transformant containing the recombinant DNA, a method for 
producing a protein having a pi , 3-galactosyltransf erase 
activity using the transformant, and a method for producing 
a galactose-containing carbohydrate using the transformant. 
(0002) 

(Background art) 

Regarding pi , 3-galactosyltransf erase genes, the 
genes derived from higher animal (J m Biol. Chem. , 273: 58 
(1998), J. Biol. Chem., 273: 12770 (1998), J. Biol. Chem. , 
274: 12499 (1999)) have been obtained. However, since it 
is generally difficult to express the genes derived from 
higher-animal as active proteins in microorganisms, a 
pi , 3-galactosyltransf erase gene derived from higher-animal 
has not been expressed as an active protein in a 
microorganism such as Escherichia coli or the like. 
(0003) 

On the other hand, in microorganisms, there is a 
report stating that a pi , 3-galactosyltransf erase gene was 
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obtained from Campylobacter jejuni and the gene was 
expressed in Escherichia coli. However, although this 
enzyme has an activity of transferring galactose to 
^-acetylgalactosamine, there is no report about the 
activity of transferring galactose to Af-acetylglucosamine 
(J. Biol. Chem., 274: 12499 (1999)). 
(0004) 

Human milk abundantly contains galactose-containing 
carbohydrates, lacto-JV-tetraose being one of the main 
components (Acta Paediatr., 82: 903 (1993), J. Pediatr. 
Gastroenterol. Nutr. , 30: 181 (2000)). Since it is known 
that lacto-N-tetraose and lacto-N-neotetraose, which are 
the galactose-containing carbohydrates, are recognized by 
Pseudomonas aeruginosa (Infect. Immun., 59: 700 (1991)), 
the galactose-containing carbohydrates are considered to be 
strong candidates for safe antiinf ection drugs which can 
prevent human body from infection with Pseudomonas 
aeruginosa . 
(0005) 

Regarding production of a galactose-containing 
carbohydrate such as lacto-N-tetraose or the like, both the 
methods of extraction from human milk and chemical 
synthesis are known but such methods have problems in terms 
of cost and productivity, so that its industrial production 
method has not yet been established. 
(0006) 
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(Problems to be solved by the invention) 

An object of the present invention is to provide a 
protein having a (31 , 3-galactosyltransf erase activity, a DNA 
encoding the protein, a method for producing a protein 
having a pi , 3-galactosyltransf erase activity using a 
transformant containing the DNA, and a method for producing 
a galactose-containing carbohydrate using the protein. 
(0007) 

(Means to solve the problems) 

In order to solve the above problems, the present 
inventors have conducted intensive studies and found a 
novel pi , 3-galactosyltransf erase, among enzymes concerning 
capsular polysaccharide biosynthesis in Streptococcus 
agalactiae , and have isolated the DNA encoding such enzyme, 
(0008) 

Specifically, the present invention relates to the 
following (1) to (19): 

(1) A protein having a pi , 3-galactosyltransf erase 
activity derived from a microorganism having an activity of 
transferring galactose to N-acetylglucosamine with 
pi , 3-linkage . 

(2) The protein according to (1), wherein the 
microorganism belongs to the genus Streptococcus. 

(3) The protein according to (2), wherein the 
microorganism is Streptococcus agalactiae . 
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(4) A protein comprising the amino acid sequence 
represented by SEQ ID N0:1. 

(5) A protein comprising an amino acid sequence in 
which at least one amino acid is deleted, replaced, 
inserted or added in the amino acid sequence represented by 
SEQ ID N0:1, said protein having a 
(Jl , 3-galactosyltransf erase activity. 

(6) A DNA encoding the protein of any one of (1) to (5). 

(7) A DNA comprising the nucleotide sequence 
represented by SEQ ID NO: 2. 

(8) A DNA which hybridizes with a DNA comprising the 
complementary sequence to the nucleotide sequence 
represented by SEQ ID NO:2 under stringent conditions, and 
encodes a protein having a pi , 3-galactosyltransf erase 
activity. 

(9) A recombinant DNA comprising the DNA of any one of 
(6) to (8) and a vector. 

(10) A transformant obtained by introducing the 
recombinant DNA of (9) into a host cell. 

(11) The transformant according to (10), wherein the 
host cell is a microorganism. 

( 12 ) The transformant according to ( 11 ) , wherein the 
microorganism belongs to the genus Escherichia . 

(13) The transformant according to (12), wherein the 
microorganism belonging to the genus Escherichia is 
Escherichia coli. 
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(14) A method for producing a protein having a 
(31 , 3-galactosyltransf erase activity, comprising culturing 
the transformant of any one of (10) to (13) in a medium to 
produce and accumulate a protein having a |3l,3- 
galactosyltransf erase activity in the culture , and 
recovering the protein from the culture. 

(15) A method for producing a galactose-containing 
carbohydrate, comprising selecting, as an enzyme source, a 
culture of the transformant of any one of (10) to (13) or a 
treated product of the culture, allowing the enzyme source, 
uridine-5 ' -diphosphogalactose and an acceptor carbohydrate 
to be present in an aqueous medium to produce and 
accumulate the galactose-containing carbohydrate in the 
aqueous medium, and recovering the galactose-containing 
carbohydrate from the aqueous medium. 

(16) The method according to (15)., wherein the treated 
product of the culture is selected from the group 
consisting of a concentrated product of the culture, a 
dried product of the culture, cells obtained by 
centrifuging the culture, a dried product of the cells, a 
freeze-dried product of the cells, a surfactant-treated 
product of the cells, an ultrasonic-treated product of the 
cells, a mechanically disrupted product of the cells, a 
solvent-treated product of the cells, an enzyme-treated 
product of the cells, a protein fraction of the cells, an 
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immobilized product of the cells and an enzyme preparation 
obtained by extracting from the cells. 

(17) The method according to (15), wherein the acceptor 
carbohydrate is a carbohydrate having W-acetylglucosamine 
at its non-reducing terminal. 

(18) The method according to (15), wherein the acceptor 
carbohydrate is selected from the group consisting of 
W-acetylglucosamine and lacto-W-triose II. 

(19) The method according to (15), wherein the 
galactose-containing carbohydrate is selected from the 
group consisting of lacto-AT-biose and lacto-N-tetraose. 
(0009) 

(Embodiments for carrying out the invention) 

The protein having a pi , 3-galactosyltransf erase 
activity of the present invention is a protein having a 
pi , 3-galactosyltransf erase activity derived from a 
microorganism which uses, as a substrate, an acceptor 
carbohydrate having W-acetylglucosamine ( hereinafter 
referred to as "GlcNAc") on its non-reducing terminal. For 
example, preferred is a protein having a 
pi , 3-galactosyltransf erase activity derived from a 
microorganism belonging to the genus Streptococcus , and 
more preferred is a protein having a 

pi , 3-galactosyltransf erase activity derived from 

Streptococcus agalactiae . 
(0010) 
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Specifically, the protein of the present invention 
includes a protein comprising the amino acid sequence 
represented by SEQ ID N0:1, and a protein comprising an 
amino acid sequence in which at most 2 0 amino acids are 
deleted, replaced, inserted or added in the amino acid 
sequence represented by SEQ ID NO:l and having a 
pi , 3-galactosyltransf erase activity. 
(0011) 

The modified protein can readily be obtained using 
a method for introducing site-directed mutation(s) 
described in, for example, Molecular Cloning, A Laboratory 
Manual, 2nd ed., Cold Spring Harbor Laboratory Press (1989) 
(hereinafter referred to as "Molecular Cloning, 2nd ed."), 
Current Protocols in Molecular Biology, John Wiley & Sons 
(1987-1997) (hereinafter referred to as "Current Protocols 
in Molecular Biology"), Nuc. Acids. Res., 10: 6487 (1982), 
Proc. Natl. Acad. Sci. USA, 79: 6409 (1982), Gene, 34: 315 
(1985), Nuc. Acids. Res., 13: 4431 (1985), Proc. Natl . Acad. 
Sci. USA, 82: 488 (1985) and the like. For example, the 
protein can be obtained by introducing mutation(s) to DNA 
encoding a protein comprising the amino acid sequence 
represented by SEQ ID NO: 2. 
(0012) 

The number of the amino acids which are deleted, 
replaced, inserted or added is not particularly limited; 



- 10 - 



however, it is usually 1 to 20, preferably 1 to 10, and 
more preferably 1 to 5 , amino acids . 

Also, in order to have the 

pi , 3-galactosyltransf erase activity of the protein of the 
present invention, it has preferably at least 50% or more, 
preferably 60% or more, still more preferably 80% or more, 
most preferably 95% or more, of identity to the amino acid 
sequence represented by SEQ ID NO:l. 
(0013) 

The DNA of the present invention includes a DNA 
encoding the protein of the present invention. 

Specific examples include a DNA encoding a protein 
comprising the amino acid sequence represented by SEQ ID 
NO:l, a DNA comprising the nucleotide sequence represented 
by SEQ ID NO: 2, and a DNA which hybridizes with a DNA 
comprising the complementary sequence to the nucleotide 
sequence represented by SEQ ID NO: 2 under stringent 
conditions and encodes a protein having a 
pi , 3-galactosyltransf erase activity . 
(0014) 

The DNA which hybridizes under stringent conditions 
is a DNA obtained by colony hybridization, plaque 
hybridization, Southern hybridization or the like using, as 
a probe, the DNA comprising the complementary sequence to 
the nucleotide sequence represented by SEQ ID NO: 2. 
Specific examples include a DNA which can be identified by 
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carrying out hybridization at 65°C in the presence of 0.7- 
1.0 M NaCl using a filter on which a DNA prepared from 
colonies or plaques is immobilized, and then washing the 
filter with O.lx to 2x SSC solution (the composition of lx 
SSC contains 150 mM sodium chloride and 15 mM sodium 
citrate) at 65°C. 
(0015) 

The hybridization can be carried out in accordance 
with a known method described in, for example, Molecular 
Cloning, 2nd ed., Current Protocols in Molecular Biology, 
DNA Cloning 1: Core Techniques, A Practical Approach, 2nd 
ed., Oxford University (1995) or the like. Specific 
examples of the DNA which can be hybridized include a DNA 
having an identity of 60% or more, preferably 80% or more, 
and more preferably 95% or more, with the complementary 
sequence to the nucleotide sequence represented by SEQ ID 
NO: 2 when calculated using above BLAST or the like. 
(0016) 

The transformant which produces the protein of the 
present invention having a pi , 3-galactosyltransf erase 
activity can be obtained, e.g., by preparing a recombinant 
DNA through ligation of the DNA of the present invention to 
a vector DNA in accordance with the method described in 
Molecular Cloning, 2nd ed. , and then transforming a host 
cell with the recombinant DNA in accordance with the method 
described in Molecular Cloning, 2nd ed. 
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(0017) 

The present invention is explained below in more 

detail . 

(1) Preparation of the DNA of the present invention 

The DNA of the present invention is desirably 
prepared from a microorganism belonging to the genus 
Streptococcus. Examples of the microorganism belonging to 
the genus Streptococcus include Streptococcus agalactiae , 
such as Streptococcus agalactiae Type lb and the like. 
(0018) 

The microorganism belonging to the genus 
Streptococcus is cultured by a known method (for example, 
the method described in J. Bacterid . , 181 : 5176 (1999)). 

After culturing, chromosomal DNA of the 
microorganism is isolated and purified by a known method 
(for example, method described in Current Protocols in 
Molecular Biology) . 

A fragment containing the DNA of the present 
invention can be obtained by a hybridization method, PCR or 
the like using a synthetic DNA designed based on a 
nucleotide sequence among the capsular polysaccharide 
biosynthesis genes of Streptococcus agalactiae Type III or 
Type la. 
(0019) 

The vector to which the DNA is ligated may be any 
vector, such as a phage vector, a plasmid vector or the 



- 13 - 



like, so long as it can replicate autonomously in 
Escherichia coli K12. Specific examples include ZAP 
Express (manufactured by Stratagene, Strategies , 5: 58 
(1992)), pBluescript II SK(+) (manufactured by Stratagene, 
Nucleic Acids Res., 17: 9494 (1989)), Xzap II (manufactured 
by Stratagene), X,gtlO and Xgtll (DNA Cloning, A Practical 
Approach, 1: 49 (1985)), XTriplEx (manufactured by 
Clontech), XExCell (manufactured by Amersham Pharmacia 
Biotech), pUC18 (Gene, 33: 103 (1985)) and the like, 
(0020) 

Any microorganism belonging to Escherichia coli can 
be used for the host of the recombinant DNA obtained by 
ligating the DNA of the present invention to the above 
vector, so long as it is a microorganism belonging to 
Escherichia coli. Specific examples include Escherichia 
coli XLl-Blue MRF ' (manufactured by Stratagene, Strategies , 
5: 81 (1992)), Escherichia coli C600 (Genetics, 39: 440 
(1954)), Escherichia coli Y1088 (Science, 222: 778 (1983)), 
Escherichia coli Y1090 (Science, 222: 778 (1983)), 
Escherichia coli NM522 (J. Mol. Biol., 166: 1 (1983)), 
Escherichia coli K802 (J. Mol. Biol., 16: 118 (1966)), 
Escherichia coli JM105 (Gene, 38: 275 (1985)) and the like. 
(0021) 

Any method can be used in the method for 
introducing the recombinant DNA, so long as it is a method 
for introducing DNA into the selected host cell. Examples 
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include a method using a calcium ion (Proc. Natl. Acad. Sci. 
USA, 69: 2110 (1972)), a protoplast (Japanese Published 
Unexamined Patent Application No. 248394/88), an 
electroporation (Nucleic Acid Res., 16: 6127 (1988)) and 
the like. 
(0022) 

The nucleotide sequence of the DNA of the present 
invention contained in the recombinant DNA can be 
determined by extracting the recombinant DNA from the thus 
obtained transf ormant . For the determination of the 
nucleotide sequence, a conventional method, such as the 
dideoxy method (Proc. Natl. Acad. Sci. USA, 74: 54 63 
(1977)) or an apparatus for nucleotide sequence analysis, 
such as DNA Sequencer 373A (manufactured by Perkin-Elmer ) 
or the like, can be used. 
(0023) 

The DNA of interest can also be prepared by 
chemical synthesis based on the thus determined nucleotide 
sequence using, for example, DNA Synthesizer 8905 Type 
manufactured by Perceptive Biosystems or the like. 

Examples of transformant containing the thus 
obtained recombinant DNA include Escherichia coli 
JM109/pBBPJ containing a plasmid DNA having the nucleotide 
sequence represented by SEQ ID NO: 2. 

(2) Preparation of the protein of the present invention. 
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The protein of the present invention can be 
produced by expressing the DNA of the present invention 
obtained by the method of (1) in a host cell, for example, 
as shown below, using a method described in Molecular 
Cloning, 2nd ed., Current Protocols in Molecular Biology or 
the like. 
(0024) 

When the DNA of the present invention is used, a 
DNA fragment of a suitable length containing a portion 
which encodes the protein of the present invention can be 
prepared, if necessary. In addition, productivity of the 
protein can be improved by substituting a nucleotide of the 
protein-coding portion of the nucleotide sequence so that 
it has the most suitable codons for the expression in the 
host . 

The transformant which expresses the DNA of the 
present invention can be obtained by inserting the DNA into 
a downstream of the promoter of a suitable expression 
vector to thereby prepare a recombinant DNA, and 
introducing the recombinant DNA into a host cell suitable 
for the expression vector. 
(0025) 

Any bacteria, yeasts, animal cells, insect cells, 
plant cells, and the like can be used as the host cell so 
long as it can express the gene of interest. 
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Examples of the expression vector include those 
which can replicate autonomously in the above-described 
host cell or can be integrated into chromosome and have a 
promoter at such a position that the DNA of the present 
invention can be transcribed. 
(0026) 

When a procaryote cell, such as a bacterium or the 
like, is used as the host cell, it is preferred that the 
recombinant DNA containing the DNA of the present invention 
can replicate autonomously in the bacterium. It is also 
preferred that the recombinant vector contains a promoter, 
a ribosome binding sequence, the DNA of the present 
invention and a transcription termination sequence. A gene 
regulating the promoter may also be desirably contained 
therewith in operable combination. 
(0027) 

Examples of the expression vector include pHelixl 
(manufactured by Roche Diagnostics), pKK223-3 (manufactured 
by Amersham Pharmacia Biotech), pSE280 (manufactured by 
Invitrogen), pGEMEX-1 (manufactured by Promega), pQE-8 
(manufactured by QIAGEN) , pET-3 (manufactured by Novagen), 
pKYPlO (Japanese Published Unexamined Patent Application No. 
110600/83), pKYP200 (Agric. Biol. Chem. , 48: 669 (1984)), 
pLSAl (Agric. Biol. Chem., 53: 277 (1989)), pGELl (Proc. 
Natl. Acad. Sci. USA, 82: 4306 (1985)), pBluescript II 
SK(+) (manufactured by Stratagene), pTrs30 (prepared from 
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Escherichia coli JMl09/pTrS30 (FERM BP-5407)), pTrs32 
(prepared from Escherichia coli JM109/pTrS32 (FERM BP- 
5408)), pPAC31 (WO 98/12343), pUC19 (Gene, 33: 103 (1985)), 
pSTV28 (manufactured by Takara Shuzo), pUC118 (manufactured 
by Takara Shuzo), pPAl (Japanese Published Unexamined 
Patent Application No. 233798/88), and the like. 
(0028) 

Any promoter can be used so long as it can function 
in the host cell. Examples include promoters derived from 
Escherichia coli, phage and the like, such as trp promoter 
(Ptrp), lac promoter (Plac), P L promoter, P R promoter, P SE 
promoter, etc., SPOl promoter, SP02 promoter, penP promoter 
and the like. Also, artificially designed and modified 
promoters, such as a promoter in which two Ptrp are linked 
in tandem (Ptrpx2), tac promoter, lacTl promoter letl 
promoter and the like, can be used. 
(0029) 

It is preferred to use a plasmid in which the space 
between Shine-Dalgarno sequence, which is the ribosome 
binding sequence, and the initiation codon is adjusted to 
an appropriate distance (for example, 6 to 18 base pairs). 

The transcription termination sequence is not 
always necessary for the expression of the DNA of the 
present invention. However, it is preferred to provide a 
transcription terminating sequence just downstream of the 
structural gene . 



- 18 - 



(0030) 

Examples of the procaryote cell include 
microorganisms belonging to the genera Escherichia , 
Serratia , Bacillus , Brevijbacteriu/n, Corynebacterium , 
Microbacterium r Pseudomonas , Streptococcus and the like. 
Specific examples include Escherichia coli XLl-Blue, 
Escherichia coli XL2-Blue, Escherichia coli DHl, 
Escherichia coli MC1000, Escherichia coli KY3276, 
Escherichia coli W1485, Escherichia coli JM109, Escherichia 
coli HB101, Escherichia coli No, 49, Escherichia coli W3110, 
Escherichia coli NY49, Serratia ficaria, Serratia fonticola, 
Serratia liquefaciens, Serratia marcescens f Bacillus 
subtilis , Bacillus megaterium, Bacillus amyloliquefaciens, 
Brevibacterium immariophilum ATCC 14068 ,>•• Brevibacterium 
saccharolyticum ATCC 14066, Corynebacterium ammoniagenes , 
Corynebacterium glutamicum ATCC 13032, Corynebacterium 
glutamicum ATCC 14067, Corynebacterium glutamicum ATCC 
13869, Corynebacterium acetoacidophilum ATCC 1387 0, 
Microbacterium ammoniaphilum ATCC 15354, Pseudomonas sp. D- 
0110, Streptococcus agalactiae Type la, Streptococcus 
agalactiae Type lb, Streptococcus agalactiae Type III, 
Streptococcus pneumoniae Type 14, and the like. 
(0031) 

With regard to the method for the introduction of 
the recombinant DNA, any method for introducing DNA into 
the above-described host cells, such as a method using a 
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calcium ion (Proc. Natl. Acad. Sci. USA, 69: 2110 (1972)), 
a protoplast (Japanese Published Unexamined Patent 
Application No. 248394/88), an electroporation (Nucleic 
Acids Res., 16: 6127 (1988)) and the like, can be used. 
(0032) 

When yeast is used as the host cell, examples of 
the expression vector include YEpl3 (ATCC 37115), YEp24 
(ATCC 37051), YCp50 (ATCC 37419), pHS19, pHS15, and the 
like. 

Any promoter can be used so long as it can function 
in yeast. Examples include PH05 promoter, PGK promoter, 
GAP promoter, ADH promoter, gal 1 promoter, gal 10 promoter, 
a heat shock polypeptide promoter, MFal promoter, CUP 1 
promoter and the like. 
(0033) 

Examples of the host cell include yeast strains 
belonging to the genera Saccharomyces , Schizosaccharomyces , 
Kluyveromyces , Trichosporon , Schwanniomyces , Pichia , 
Candida and the like. Specific examples include 

Saccharomyces cerevisiae , Schizosaccharomyces pombe, 
Kluyveromyces lactis, Trichosporon pullulans , 

Schwanniomyces alluvius , Pichia pastoris , Candida utilis 
and the like. 
(0034) 

With regard to the method for the introduction of 
the recombinant DNA, any method for introducing DNA into 
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yeast, such as an electroporation (Methods. Enzymol . , 
194: 182 (1990)), a spheroplast method (Proc. Natl. Acad. 
Sci. USA, 75: 1929 ( 1978)), a lithium acetate method (J. 
Bacteriol., 153: 163 (1983)) and the like, can be used. 
(0035) 

When an animal cell is used as the host cell, 
examples of the expression vector include pcDNAI and pcDM8 
(manufactured by Funakoshi), pAGE107 (Japanese Published 
Unexamined Patent Application No. 22979/91), pAS3-3 
(Japanese Published Unexamined Patent Application No. 
227075/90), pcDM8 (Nature, 329: 840 (1987)), pcDNAI/Amp 
(manufactured by Invitrogen), pREP4 (manufactured by 
Invitrogen), pAGEl03 (J. Biochem. , 101: 1307 (1987)), 
pAGE210, pAMo, pAMoA and the like. 
(0036) 

Any promoter can be used so long as it can function 
in an animal cell. Examples include a promoter of IE 
(immediate early) gene of cytomegalovirus (CMV), an early 
promoter of SV40, a metallothionein promoter, a promoter of 
retrovirus, a heat shock promoter, SRa promoter, and the 
like. Also, the enhancer of the IE gene of human CMV can 
be used together with the promoter. 
(0037) 

Examples of the host cell include mouse myeloma 
cell, rat myeloma cell, mouse hybridoma cell, human Namalwa 
cell, Namalwa KJM-1 cell, human fetal kidney cell, human 
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leukemia cell, African grivet kidney cell, Chinese hamster 
ovary (CHO) cell HST5637 (Japanese Published Unexamined 
Patent Application No. 299/88) and the like. 
(0038) 

Examples of the mouse myeloma cell include SP2/0, 
NSO and the like. Examples of the rat myeloma cell include 
YB2/0 and the like. Examples of the human fetal kidney 
cell include HEK293 (ATCC: CRL-1573) and the like. 
Examples of the human leukemia cell include BALL-l and the 
like. Examples of the African grivet kidney cell -include 
COS-1, COS-7 and the like. 

The method for introduction of the recombinant DNA 
into animal cells is not particularly limited, so long as 
it is the general method for introducing DNA into animal 
cells, such as an electroporation (Cytotechnology , 3: 133 
(1990)), a calcium phosphate method (Japanese Published 
Unexamined Patent Application No. 227075/90), a lipofection 
(Proc. Natl. Acad. Sci. USA, 84, 7413 ( 1987)), the method 
described in Virology, 52: 456 (1973) and the like. 
(0039) 

When an insect cell is used as the host cell, the 
protein can be expressed by a known method described in, 
for example, Baculovirus Expression Vectors, A Laboratory 
Manual, W.H. Freeman and Company, New York (1992), 
Molecular Biology, A Laboratory Manual, Current Protocols 
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in Molecular Biology, Bio/Technology, 6: 47 ( 1988) or the 

like. 

(0040) 

Specifically, a transfer vector containing the DNA 
to make it express and baculovirus are co-transf ected into 
an insect cell to obtain a recombinant virus in a 
supernatant of the culture of its insect cell, and then an 
insect cell is infected with the resulting recombinant 
virus to express the protein. 

Examples of the transfer vector used in the method 
include pVLl392, pVLl393 and pBlueBacIII (all manufactured 
by Invitrogen), and the like. 
(0041) 

Examples of the baculovirus include Autographa * 
calif ornica nuclear polyhedrosis virus which infects . 
insects of the family Barathra and the like. 

Examples of the insect cell include Spodoptera 
frugiperda ovary cell, Trichoplusia ni ovary cell, Bombyx 
mori ovary-derived culturing cell and the like. 
(0042) 

Examples of Spodoptera frugiperda ovary cells 
include Sf9 and Sf21 (Baculovirus Expression Vectors, A 
Laboratory Manual) and the like. Examples of Trichoplusia 
ni ovary cells include High 5 and BTI-TN-5B1-4 
(manufactured by Invitrogen) and the like. Examples of the 
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cell line derived from silkworm ovary cellinclude Bombyx 
mori N4 and the like. 

The methods for co-transf ecting the above transfer 
vector and the above baculovirus for the preparation of the 
recombinant virus include a calcium phosphate method 
(Japanese Published Unexamined Patent Application No, 
227075/90), a lipofection (Proc. Natl. Acad. Sci. USA, 
84: 7413 (1987)), and the like. 
(0043) 

When a plant cell is used as the host cell, 
examples of expression vector include Ti plasmid, a tobacco 
mosaic virus vector, and the like. 

Any promoter can be used so long as it can function 
in a plant cell. Examples include 35S promoter of 
cauliflower mosaic virus (CaMV), rice actin 1 promoter, and 
the like. 
(0044) 

Examples of the host cells include plant cells and 
the like, such as tobacco, potato, tomato, carrot, soybean, 
rape, alfalfa, rice, wheat, barley, and the like. 

The method for introducing the recombinant DNA is 
not particularly limited, so long as it is the general 
method for introducing DNA into a plant cell, such as the 
Agrobacterium method (Japanese Published Unexamined Patent 
Application No. 140885/84, Japanese Published Unexamined 
Patent Application No. 70080/85, WO 94/00977), the 
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electroporation (Japanese Published Unexamined Patent 
Application No, 251887/85), the particle gun method 
(Patents 2606856 and 2517813), and the like* 
(0045) 

The gene can be expressed as a secretory protein or 
a fusion protein and the like in accordance with the 
methods described in Molecular Cloning, 2nd ed., in 
addition to direct expression. 

When expressed in yeast, an animal cell or an 
insect cell, a glycosylated protein can be obtained. 
(0046) 

The protein of the present invention can be 
produced by culturing the thus obtained transformant of the 
present invention in a medium to produce and accumulate the 
protein in the culture, and recovering the protein from the 
culture. 

Culturing of the transformant of the present 
invention in a medium is carried out according to the 
conventional method as used in culturing of the host. 
(0047) 

As a medium for culturing the transformant obtained 
by using, as the host, prokaryote (such as Escherichia coli 
or the like) or eukaryote (such as yeast or the like), the 
medium may be either a natural medium or a synthetic medium, 
so long as it contains a carbon source, a nitrogen source, 
an inorganic salt and the like which can be assimilated by 
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the organism and the transf ormant can be cultured 
efficiently. 

Examples of the carbon source which can be 
assimilated by the transf ormant include carbohydrates (for 
example, glucose, fructose, sucrose, molasses containing 
them, starch, starch hydrolysate, etc.), organic acids (for 
example, acetic acid, propionic acid, etc.)/ alcohols (for 
example, ethanol, propanol, etc.), and the like. 
(0048) 

Examples of the nitrogen source include ammonia, 
various ammonium salts of inorganic acids or organic acids 
(for example, ammonium chloride, ammonium sulfate, ammonium 
acetate, ammonium phosphate, etc.), other nitrogen- 
containing compounds, peptone, meat extract, yeast extract, 
corn steep liquor, casein hydrolysate, soybean meal and 
soybean meal hydrolysate, various fermented cells and 
hydrolysates thereof, and the like. 
(0049) 

Examples of the inorganic salt include potassium 
dihydrogen phosphate, dipotassium hydrogen phosphate, 
magnesium phosphate, magnesium sulfate, sodium chloride, 
ferrous sulfate, manganese sulfate, copper sulfate, calcium 
carbonate, and the like. 

Culturing is usually carried out under aerobic 
conditions by shaking culture , submerged spinner culture 
under aeration or the like. The culturing temperature is 
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preferably from 15 to 4 0°C, and the culturing time is 
generally from 5 hours to 7 days. The pH of the medium is 
preferably maintained at 3.0 to 9.0 during culturing. The 
pH can be adjusted using an inorganic or organic acid, an 
alkali solution, urea, calcium carbonate, ammonia, or the 
like . 
(0050) 

Also, antibiotics such as ampicillin, tetracycline, 
and the like, can be added to the medium, during culturing, 
if necessary. 

When a microorganism transformed with an expression 
vector containing an inducible promoter is cultured, an 
inducer can be added to the medium, if necessary. For 
example, isopropyl-P-D-thiogalactopyranoside (IPTG) or the 
like can be added to the medium when a microorganism 
transformed with an expression vector containing lac 
promoter is cultured, or indoleacrylic acid or the like can 
by added thereto when a microorganism transformed with an 
expression vector containing trp promoter is cultured. 
(0051) 

Examples of the medium for culturing a trans formant 
obtained using an animal cell as the host include generally 
used RPMI 1640 medium (The Journal of the American Medical 
Association, 199 : 519 (1967)), Eagle's MEM (Science, 
122: 501 (1952)), DMEME (Virology, 8: 396 (1959)), and 199 
Medium (Proceeding of the Society for the Biological 
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Medicine, 73: 1 (1950)), as well as other media to which 
fetal calf serum or the like has been added to the above 
media and the like. 
(0052 ) 

Culturing is generally carried out under conditions 
at pH 6 to 8 and at 30 to 4 0°C for 1 to 7 days in the 
presence of 5% C0 2 or the like. 

Furthermore, if desired, antibiotics such as 
kanamycin, penicillin, streptomycin and the like, can be 
added to the medium during culturing. 

Examples of the medium for culturing a transformant 
obtained using an insect cell as the host include generally 
used TNM-FH medium (manufactured by Pharmingen), Sf-900 II 
SFM (manufactured by Life Technologies), ExCell 400 and 
ExCell 405 (both manufactured by JRH Biosciences ),. Grace ' s 
Insect Medium (Nature, 195: 788 (1962)) and the like. 
(0053) 

Culturing is generally carried out under conditions 
at pH 6 to 7 and at 25 to 30°C for 1 to 5 days or the like. 

Furthermore, if desired, antibiotics such as 
gentamicin and the like, can be added to the medium during 
culturing. 

A transformant obtained using a plant cell as the 
host cell can be used as the cell or after differentiating 
to a plant cell or organ. Examples of the medium used in 
culturing of the transformant include Murashige and Skoog 
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(MS) medium, White medium, media to which a plant hormone, 
such as auxin, cytokinine, or the like has been added, and 
the like. 
(0054) 

Culturing is carried out generally * at a pH 5 to 9 
and at 20 to 40°C for 3 to 60 days. 

Also, antibiotics such as kanamycin, hygromycin and 
the like, can be added to the medium during culturing, if 
necessary. 

As described above, the protein can be produced by 
culturing a transformant derived from a microorganism, 
animal cell or plant cell containing a recombinant DNA to 
which a DNA encoding the protein of the present invention 
has been inserted according to the general culturing method 
to produce and accumulate the protein, and recovering the 
protein from the culture. 
(0055) 

The method for producing the protein of the present 
invention includes a method of intracellular expression in 
a host cell, a method of extracellular secretion from a 
host cell, or a method of production on an outer membrane 
of the host cell. The method can be selected by changing 
the host cell employed or the structure of the protein 
produced. 

When the protein of the present invention is 
produced in a host cell or on an outer membrane of the host 
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cell, the protein can be actively secreted extracellular ly 
according to, for example, the method of Paulson et al. (J. 
Biol. Chem. , 264: 17619 (1989)), the method of Lowe et al. 
(Proc. Natl. Acad. Sci. USA, 86: 8227 (1989); Genes 
Develop. , 4: 1288 (1990)), or the methods described in 
Japanese Published Unexamined Patent Application Nos. 
336963/93, 823021/94, and the like. 
(0056) 

Specifically, the protein of the present invention 
can be actively secreted extracellularly by expressing it 
in the form that a signal peptide has been added to the 
side of N-terminal of a protein containing an active site 
of the protein of the present invention according to the 
recombinant DNA technique, 
(0057) 

Furthermore, the amount produced can be increased 
using a gene amplification system, such as by use of a 
dihydrof olate reductase gene or the like according to the 
method described in Japanese Published Unexamined Patent 
Application No. 227075/90. 

Moreover, the protein of the present invention can 
be produced by a transgenic animal (transgenic nonhuman 
animal) or plant (transgenic plant). 

When the transformant is the nonhuman animal 
individual or plant individual, the protein of the present 
invention can be produced by breeding or cultivating it so 
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as to produce and accumulate the protein, and recovering 
the protein from the nonhuman animal individual or plant 
individual . 
(0058) 

Examples of the method for producing the protein of 
the present invention using the nonhuman animal individual 
include a method for producing the protein of the present 
invention in a nonhuman animal developed by introducing a 
gene according to known methods (Am, J. Clin. Nutr., 
63: 639S (1996), Am. J. Clin. Nutr., 63z 627S (1996), 
Bio / Technology , 9: 830 (1991)). 

In the nonhuman animal individual, the protein can 
be produced by breeding a transgenic nonhuman animal to 
which the DNA encoding the protein of the present invention 
has been introduced to produce and accumulate the protein 
in the animal, and recovering the protein from the animal. 
Examples of the production and accumulation place in the 
animal include milk (Japanese Published Unexamined Patent 
Application No. 309192/88), egg and the like of the animal. 
Any promoter can be used, so long as it can function in the 
animal. Suitable examples include an a-casein promoter, a 
(5-casein promoter, a (3-lactoglobulin promoter, a whey 
acidic protein promoter, and the like, which are specific 
for mammary glandular cells. 
(0059) 
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Examples of the method for producing the protein of 
the present invention using the plant individual include a 
method for producing the protein of the present invention 
by cultivating a transgenic plant to which the DNA encoding 
the protein of the present invention is introduced by a 
known method (Tissue Culture, 20 (1994), Tissue Culture, 21 
(1994), Trends in Biotechnol., 15: 45 (1997)) to produce 
and accumulate the protein in the plant, and recovering the 
protein from the plant. 
(0060) 

The protein produced by the transformant of the 
present invention can be isolated and purified using the 
general method for isolating and purifying an enzyme. 

For example, when the protein of the present 
invention is expressed as a soluble product in the host 
cells, the cells are collected by centrif ugation after 
culturing, suspended in an aqueous buffer, and disrupted 
using an ultrasonicator , a French press, a Manton Gaulin 
homogenizer, a Dynomill, or the like to obtain a cell-free 
extract . 
(0061) 

From the supernatant obtained by centrifuging the 
cell-free extract, a purified product can be obtained by 
the general method used for isolating and purifying an 
enzyme, for example, solvent extraction, salting-out using 
ammonium sulfate or the like, desalting, precipitation 
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using an organic solvent, anion exchange chromatography 
using a resin, such as diethylaminoethyl ( DEAE ) -Sepharose , 
DIAION HPA-75 (manufactured by Mitsubishi Chemical) or the 
like, cation exchange chromatography using a resin, such as 
S-Sepharose FF (manufactured by Pharmacia) or the like, 
hydrophobic chromatography using a resin, such as butyl 
sepharose, phenyl sepharose or the like, gel filtration 
using a molecular sieve, affinity chromatography, 
chromatof ocusing, or electrophoresis, such as isoelectronic 
focusing or the like, alone or in combination thereof. 
(0062) 

When the protein is expressed as an inclusion body 
in the host cells, the cells are collected in the same 
manner, disrupted and centrifuged to recover the protein as 
the precipitate fraction. Next, the inclusion body of the 
protein is solubilized with a protein-denaturing agent. 

The solubilized protein solution is diluted or 
dialyzed to lower the concentration of the protein 
denaturing agent in the solution. Thus, the normal 
tertiary structure of the protein is reconstituted. After 
the procedure, a purified product of the protein can be 
obtained by a purification/isolation method similar to the 
above . 
(0063) 

When the protein of the present invention or its 
glycosylated-derivative is secreted out of cells, the 
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protein or its derivative can be collected in the culture 
supernatant . 

Namely , the culture supernatant is obtained by 
treating the culture medium in a treatment similar to the 
above, such as centrif ugation or the like. Then a purified 
product can be obtained from the supernatant using a 
purification/isolation method similar to the above* 
(0064) 

Examples of the protein obtained by the above 
method include a protein comprising the amino acid sequence 
represented by SEQ ID NO:l. 

Furthermore, a fusion protein of the protein of the 
present invention and other protein is produced, and can be 
purified using affinity chromatography using a substance 
having affinity to the fusion protein. For example, the 
protein of the present invention is produced as a fusion 
protein with protein A according to the method of Lowe et 
al. (Proc. Natl. Acad. Sci. USA, 86: 8227 (1989); Genes 
Develop., 4: 1288 (1990)), or the method described in 
Japanese Published Unexamined Patent Application No. 
336963/93 or 823021/94, and it can be purified by affinity 
chromatography using immunoglubulin G. 
(0065) 

Moreover, the protein of the present invention is 
produced as a fusion protein with Flag peptide, and the 
fusion protein can be purified by affinity chromatography 
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using an anti-Flag antibody (Proc. Natl. Acad. Sci., USA, 
86 i 8227 (1989)). Further purification can be carried out 
by affinity chromatography using the antibody against the 
protein per se. 
(0066) 

Also, based on the information of the thus obtained 
protein, the protein of the present invention can be 
produced by a chemical synthesis method, such as Fmoc 
( f luorenylmethyloxycarbonyl ) method, tBoc 

( t-butyloxycarbonyl ) method, or the like. It can also be 
chemically synthesized using a peptide synthesizer 
manufactured by Advanced ChemTech, Perkin-Elmer , Pharmacia, 
Protein Technology Instrument, Synthecell-Vega, PerSeptive, 
Shimadzu Corporation, or the like. 

(3) Preparation of galactose-containing carbohydrate 

A galactose-containing carbohydrate can be produced 
in an aqueous medium using a culture of the transformant 
obtained by culturing described in (2) or a treated product 
of the culture as the enzyme source. 
(0067) 

Examples of the treated product of culture include 
a concentrated product of the culture, a dried product of 
the culture, cells obtained by centrifuging the culture, a 
dried product of the cells, a freeze-dried product of the 
cells, a surfactant-treated product of the cells, an 
ultrasonic-treated product of the cells, a mechanically 
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disrupted product of the cells , a solvent-treated product 
of the cells, an enzyme-treated product of the cells, a 
protein fraction of the cells, an immobilized product of 
the cells, an enzyme preparation obtained by extracting 
from the cell, and the like. 
(0068) 

The enzyme source for use in the production of a 
galactose-containing carbohydrate is used in a 
concentration of 1 mU/1 to 1,000 U/l, preferably 10 mU/1 to 
100 U/l, when the activity capable of forming 1 |amol of 
galactose-containing carbohydrate at 37°C in 1 minute is 
defined as 1 unit (U) . 

Examples of the aqueous medium for use in the 
production of a galactose-containing carbohydrate include 
water, buffer solutions (for example, phosphate buffer, 
carbonate buffer, acetate buffer, borate buffer, citrate 
buffer, tris bufrer, etc.), alcohols (for example, methanol, 
ethanol, etc.), esters (for example, ethyl acetate, etc.), 
ketones (for example, acetone, etc.), amides (for example, 
acetamide, etc.), and the like. Also, the culture of the 
microorganisms used as the enzyme source can be used as an 
aqueous medium. 
(0069) 

In producing a galactose-containing carbohydrate, a 
surfactant or an organic solvent may be added, if necessary. 
Any surfactant capable of accelerating the formation of a 
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galactose-containing carbohydrate may be used as the 
surfactant. Examples include nonionic surfactants (for 
example, polyoxyethylene octadecylamine (e.g., Nymeen S-215, 
manufactured by Nippon Oil & Fats), etc.), cationic 
surfactants (for example, cetyltrimethylammonium bromide, 
alkyldimethyl benzylammoniumchloride (e.g., Cation F2-40E, 
manufactured by Nippon Oil & Fats), etc.), anionic 
surfactants (for example, lauroyl sarcosinate, etc.), 
tertiary amines (for example, alkyldimethylamine (e.g., 
Tertiary Amine FB, manufactured by Nippon Oil & Fats), 
etc.), and the like, which are used alone or as a mixture 
of two or more. The surfactant is used generally in a 
concentration of 0.1 to 50 g/1. Examples of the organic 
solvent include xylene, toluene, fatty acid alcohol, 
acetone, ethyl acetate, and the like, which are used in a 
concentration of generally 0.1 to 50 ml/1. 
(0070) 

The galactose-containing carbohydrate formation 
reaction is carried out in an aqueous medium having a pH 5 
to 10, preferably 6 to 8 , at 20 to 50°C for 1 to 96 hours. 
In this formation reaction, inorganic salts, such as MnCl 2 , 
MgCl 2 and the like, can be added, if necessary. 

The amount of the galactose-containing carbohydrate 
produced in the aqueous medium can be determined, for 
example, using a glycosyl analyzer manufactured by Dionex 
(Anal. Biochem. , 189 : 151 (1990)). 
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(0071) 

The galactose-containing carbohydrate produced in 
the aqueous medium can be collected by the ordinary methods 
using activated carbon, ion exchange resins, and the like. 

The present invention is explained based on the 
examples, but the scope of the present invention is not 
limited thereto, 
(0072) 
(Examples ) 

Example 1 Isolation of DNA containing (31,3- 

galactosyltransf erase gene : 

(1) Cloning of capsular polysaccharide biosynthesis genes 
from Streptococcus agalactiae Type la (1) 

Streptococcus agalactiae Type la was cultured by 
the method described in J. Bacterid., 181 : 5176 (1999). 
After collecting the cells by centrif ugation, chromosomal 
DNA of the microorganism was isolated and purified in 
accordance with the method described in Current Protocols 
in Molecular Biology. 
(0073) 

DNAs having the nucleotide sequences represented by 
SEQ ID NOs:4 and 5 which had been designed based on the 
nucleotide sequence around the cpsD gene which is one of 
the capsular polysaccharide biosynthesis genes of 
Streptococcus agalactiae Type III (Mol. Microbiol., 8: 843 
(1993)) were synthesized using a DNA synthesizer Model 8905 
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manufactured by Perceptive Biosystems . Using these 

synthetic DNAs as a primer set, PCR was carried out using 
the chromosomal DNA of Streptococcus agalactiae Type la as 
the template. The PCR was carried out using 40 \xl of a 
reaction solution containing 0.1 \xg of the chromosomal DNA, 
0.5 ^mol/1 each primer, 2.5 units of TaKaRa LA Taq 
polymerase (manufactured by Takara Shuzo) , 4 jxl of a buffer 
solution for TaKaRa LA Taq polymerase and 200 |nmol/l each 
deoxyNTP and repeating a reaction step consisting of 1 
minute at 94°C, 2 minutes at 42°C and 3 minutes at 72°C 30 
times. A probe was prepared by labeling the thus amplified 
fragment using Random Primer DNA Labeling Kit (manufactured 
by Takara Shuzo). 
(0074) 

Fragments obtained by digesting the chromosomal DNA 
of Streptococcus agalactiae Type la with a restriction 
enzyme EcoRT were ligated to pBluescript II SK(+) to 
prepare recombinant DNAs, and a library was prepared by 
transforming E . coli JM109 with these recombinant DNAs. 

Using the above probe and library, colony 
hybridization was carried out according to the method known 
by persons having ordinary skill in the art to obtain a 
strain of clone showing a strong signal. A plasmid 
contained in this strain was named pBAlOl and its structure 
was analyzed to find that it was a structure in which a 3.5 
kb fragment derived from the chromosomal DNA of 
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Streptococcus agalactiae Type la was inserted into 
pBluescript II SK(+), The nucleotide sequence of the DNA 
was determined according to the method known by persons 
having ordinary skill in the art, and as a result, the 
three genes named cpsIaF, cpslaG and cpsIaH, and a part of 
a gene named cpsIaE described in J. Bacterid., 181: 5176 
(1999) were found in the DNA, so that it was confirmed that 
the DNA contains a part of the capsular polysaccharide 
biosynthesis genes. Also, as a result of homology search, 
it was confirmed that the cpsIaE gene and the cpslaG gene 
had high homology with the glucosyltransf erase gene and 
with the pi , 4-galactosyltransf erase gene, respectively 
(Fig. 1). 

(2) Cloning of capsular polysaccharide biosynthesis gene 
from Streptococcus agalactiae Type la (2) 

The 3.5 kb fragment of Streptococcus agalactiae 
Type la, which had been inserted into the plasmid pBAlOl 
obtained in (1) of Example 1, was labeled using Random 
Primer DNA Labeling Kit (manufactured by Takara Shuzo) to 
prepare a probe. 
(0075) 

Fragments obtained by digesting the chromosomal DNA 
of Streptococcus agalactiae Type la with a restriction 
enzyme Bglll were ligated to pBluescript II SK(+) to 
prepare recombinant DNAs, and a library was prepared by 
transforming E. coli JM109 with these recombinant DNAs. 
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Using the above probe and library, colony 
hybridization was carried out so as to obtain a strain of 
clone showing a strong signal. A plasmid contained in this 
strain was named pBA103 and its structure was analyzed to 
find that it was a structure in which a 3.1 kb DNA derived 
from the chromosomal DNA of Streptococcus agalactiae Type 
la was inserted into pBluescript II SK(+). When the 
nucleotide sequence of the 3.1 kb DNA was determined 
according to the method known by persons having ordinary 
skill in the art, genes named cpslal and cpsIaJ and a part 
of the cpsIaH gene were found in the DNA, and it was 
confirmed that the DNA contained in pBAl03 was a DNA 
adjacent to the DNA contained in pBAlOl on the chromosomal 
DNA and that pBA103 is also a plasmid which contains a part 
of the DNA of the capsular polysaccharide biosynthesis 
genes. As a result of homology search, it was confirmed 
that the cpslal gene and the cpsIaJ gene have high homology 
with the pi , 3-N-acetylglucosaminyltransf erase gene and the 
(31 , 4-galactosyltransf erase gene, respectively (Fig. 1, J\ 
Bacterid., 181 : 5176 (1999)). 

(3) Isolation of DNA containing (31 , 3-galactosyltransf erase 
gene from Streptococcus agalactiae Type lb 

The 3.1 kb fragment of Streptococcus agalactiae 
Type la, which had been inserted into the plasmid pBA103 
obtained in (2) of Example 1, was labeled using Random 
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Primer DNA Labeling Kit (manufactured by Takara Shuzo) to 

prepare a probe ♦ 

(0076) 

Streptococcus agalactiae Type lb was cultured by 
the method described in J. Bacterid. , 181: 5176 (1999). 
After collecting the cells by centrif ugation , chromosomal 
DNA of the microorganism was isolated and purified in 
accordance with the method described in Current Protocols 
in Molecular Biology. 

Fragments obtained by digesting the chromosomal DNA 
of Streptococcus agalactiae Type lb with a restriction 
enzyme Bglll were introduced into pBluescript II SK(+) and 
transformed into E . coli JM109 to prepare a library. 
(0077) 

Using the above probe and library, colony 
hybridization was carried out to obtain two cloned strains 
showing a strong signal. Plasmids contained in these 
strains were named pBBl02 and pBB103, respectively, and 
their structures were analyzed to find that they were 
structures in which 5.5 and 1.4 kb fragments derived from 
the chromosomal DNA of Streptococcus agalactiae Type lb 
were inserted into pBluescript II SK(+), respectively 
(Fig. 1). 
(0078) 

When the nucleotide sequences of these two DNAs 
were determined according to the method known by persons 
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having ordinary skill in the art, it was determined that 
these two DNAs were present by adjacent to each other on 
the chromosomal DNA and having the continued nucleotide 
sequence represented by SEQ ID NO: 3 • It was revealed that 
genes named cpsIbE , cpsIbF , cpsIbG, cpsIbH, cpslbl and 
cpsIbJ are present in the DNA having the nucleotide 
sequence represented by SEQ ID NO: 3. As a result of 
homology search, the cpsIbE gene showed high homology with 
the cpsIaE gene and the glucosyltransf erase gene, the 
cpsIbG gene with the cpslaG gene and the (31,4- 
galactosyltransf erase gene, and the cpslbl gene with the 
cpslal gene and the pi , 3-itf-acetylgalactosaminyltransf erase 
gene. Accordingly, it was confirmed that the DNA having 
the nucleotide sequence represented by SEQ ID NO: 3 contains 
a part of the capsular polysaccharide biosynthesis genes in 
Streptococcus agalactiae Type lb. 
(0079) 

Also, it was considered that the cpsIbJ gene is a 
gene comprising a DNA encoding a protein having a 
pi, 3-galactosyltransf erase activity because the protein 
encoded by the cpsIbJ gene having the nucleotide sequence 
represented by SEQ ID NO: 2 has a preserved sequence of a 
glycosyltransf erase but its homology with the cpsIaJ gene 
is low, a cpsIaJ gene product takes a role in transferring 
galactose in the capsular polysaccharide biosynthesis in 
Streptococcus agalactiae Type la (J . Bacterid., 181: 5176 
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(1999)), and the linkages of galactose of capsular 
polysaccharide in Streptococcus agalactiae Type la and 
Streptococcus agalactiae Type lb are different, which are 
(31,4 and pi, 3, respectively (J. Bacterid. , 181 1 5176 
(1999)). The amino acid sequence of the protein encoded by 
this DNA is shown in SEQ ID N0:1. 

Example 2 Construction of a strain expressing 
pi , 3-galactosyltransf erase gene : 

A DNA containing the galactosyltransf erase gene 
obtained in (3) of Example 1 was amplified by the following 
method using DNAs having the nucleotide sequences 
represented by SEQ ID NOs : 6 and 7 synthesized using DNA 
Synthesizer Model 8905 manufactured by Perceptive 
Biosystems . 
(0080) 

PCR was carried out using these synthesized DNAs as 
a primer set and the Streptococcus agalactiae Type lb 
chromosomal DNA as the template. Using 4 0 jxl of a reaction 
solution containing 0.1 \ig of the chromosomal DNA, 0.5 
^mol/1 each primer, 2.5 units of TaKaRa LA Taq polymerase 
(manufactured by Takara Shuzo), 4 ^1 of a buffer solution 
for TaKaRa LA Taq polymerase and 2 00 |jmol/l each deoxyNTP, 
PCR was carried out by repeating a step of 94°C for 1 
minute, 42°C for 2 minutes and 72°C for 3 minutes 30 times. 
(0081) 
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After confirming amplification of the fragment of 
interest by subjecting 1/10 volume of the reaction solution 
to agarose gel electrophoresis, the amplified fragment was 
recovered from the remaining reaction solution using 
GeneClean II Kit (manufactured by Bio 101) and then 
dissolved in TE buffer (10 mmol/1 Tris-HCl and 1 mmol/1 
EDTA (pH 8.0)) to obtain 20 jxl of the DNA solution. 

Using 5 \il of the dissolved solution , the DNA was 
digested with restriction enzymes Not I and XhoZ, the 
resulting DNAs were separated using agarose gel 
electrophoresis and then a DNA of 2.0 kb was recovered 
using GeneClean II Kit. 
(0082) 

After 0.2 \xg of pBluescript II SK( + ) DNA was 
digested with restriction enzymes Notl and Xhol, the 
resulting DNAs were separated by agarose gel 
electrophoresis and then a DNA of 3.0 kb was recovered 
using GeneClean II Kit. 

Using a ligation kit, the 2.0 kb and 3.0 kb 
fragments were ligated at 16°C for 16 hours. 
(0083) 

Using the ligation reaction solution, E. coll JM109 
was transformed in accordance with the method known by 
persons having ordinary skill in the art, and the 
transf ormants were spread on LB agar medium (10 g/1 Bacto 
Tryptone (manufactured by Difco), 10 g/1 Yeast Extract 
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(manufactured by Difco), 5 g/1 sodium chloride, 15 g/1 
agar) containing 50 jxg/ml ampicillin, followed by culturing 
overnight at 37°C. 

By extracting plasmid from the thus grown 
transformant colonies in accordance with the method known 
by persons having ordinary skill in the art, an expression 
plasmid pBBPIJ was obtained. The structure of this plasmid 
was confirmed by restriction enzyme digestion (Fig. 2). 
(0084) 

In the same manner, -PCR was carried out using DNAs 
having the nucleotide sequences represented by SEQ ID NOs:7 
and 8 synthesized using DNA Synthesizer Model 8905 
manufactured by Perceptive Biosystems as a primer set and 
the Streptococcus agalactiae type lb chromosomal DNA as the 
template . 

After confirming amplification of the fragment of 
interest by subjecting 1/10 volume of the reaction solution 
to agarose gel electrophoresis, the amplified fragment was 
recovered from the remaining reaction solution using 
GeneClean II Kit (manufactured by Bio 101) to obtain 20 \xl 
TE solution of the DNA. 
(0085) 

Using 5 \il of the solution, the DNA was digested 
with restriction enzymes EcoRI and Xhol, the resulting DNAs 
were separated by agarose gel electrophoresis and then a 
DNA of 1.0 kb was recovered using GeneClean II Kit. 
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After 0.2 \xg of pBluescript II SK( + ) DNA was 
digested with restriction enzymes EcoRl and Xhol, the 
resulting DNAs were separated by agarose gel 
electrophoresis and then a DNA of 3.0 kb was recovered 
using GeneClean II Kit. 
(0086) 

Using a ligation kit, the 1.0 kb and 3.0 kb 
fragments were ligated at 16°C for 16 hours. 

Using the ligation reaction solution, E. coll JM109 
was transformed in accordance with the < method known by 
persons having ordinary skill in the art, and the 
transf ormants were spread on LB agar medium containing 50 
\xg/ml ampicillin, followed by culturing overnight at 37°C. 
(0087) 

By extracting plasmid from the thus grown 
transformant colonies in accordance with the method known 
by persons having ordinary skill in the art, an expression 
plasmid pBBPJ was obtained. The structure of this plasmid 
was confirmed by restriction enzyme digestion (Fig. 2). 

Escherichia coli JMl09/pBBPJ having the plasmid 
pBBPJ containing a DNA encoding a protein having a 
(31 , 3-galactosyltransf erase activity derived from 

Streptococcus agalactiae Type lb has been deposited as FERM 
BP-7400 on December 21 , 2000 in National Institute of 
Bioscience and Human-Technology, Agency of Industrial 
Science and Technology (1-3, Higashi 1-chome, Tsukuba-shi, 
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Ibaraki-ken, 305-8566 Japan) (present name and address: 
International Patent Organism Depositary/ National 
Institute of Advanced Industrial Science and Technology 
(AIST Tsukuba Central 6, 1-1, Higashi 1-chome Tsukuba-shi, 
Ibaraki-ken, 305-8566 Japan)). 
Example 3 Production of lacto-i\T-tetraose : 

Each of the Escherichia coli JMl09/pBBPIJ and 
JM109/pBBPJ obtained in Example 2 was inoculated into a 
test tube charged with 8 ml of LB medium containing 50 
|ng/ml ampicillin respectively, followed by culturing- at 
37°C for 17 hours. The culture was inoculated at 1% into a 
test tube charged with 8 ml of LB medium containing 50 
^ig/ml ampicillin respectively, followed by culturing at 
37°C for 5 hours, and then IPTG was added thereto to give a 
concentration of 1 mmol/1. Two hours after additional 
culturing, wet cells were obtained by centrif ugation . A 
membrane fraction was prepared from the wet cells in 
accordance with the method known by persons having ordinary 
skill in the art (J. Biol. Chem. , 272: 19502 (1997), Mol . 
Microbiol., 26: 197 (1997)). Since this membrane fraction 
can be stored at -80°C, if necessary, it was able to use it 
by thawing prior to use. 
(0088) 

Lacto-AT-triose II to be used as the acceptor 
carbohydrate was prepared by allowing lacto-I\T-neotetraose 
(manufactured by Sigma) to react with p-galactosidase 
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(manufactured by Seikagaku Corporation), completely 
removing the non-reducing terminal galactose and then 
inactivating the p-galactosidase activity by heat treatment 
at 100°C for 5 minutes. 

The reaction was carried out at 37°C for 72 hours 
in 0.1 ml of a reaction solution containing the 
JMl09/pBBPIJ membrane fraction (200 jxg/ml), 50 mmol/1 
citrate buffer (pH 7.0), 5 mmol/1 MgCl 2 , 10 mmol/1 lacto- 
W-triose II and 5 mmol/1 UDP-galactose . 
(0089) 

After completion of the reaction, the reaction 
product was analyzed using a High-performance anion- 
exchange chromatography with pulsed amperometric detection 
system manufactured by Dionex (DX-500) under the following 
analyzing conditions to confirm that 0.2 mmol/1 lacto- : . 
N-tetraose was produced and accumulated in the reaction 
solution . 

In the same manner, it was confirmed that 0.05 
mmol/1 lacto-N-tetraose was produced and accumulated when a 
JMl09/pBBPJ membrane fraction was used. 
Analyzing conditions: 

Column: CarboPAC PA10 

Eluent: A; H 2 0, B; 500 mmol/1 NaOH 

Gradient: 10 to 70% B in 20 min 
Detector: Pulsed amperometric detector 

(0090) 
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(Effect of the invention) 

The present invention provides a protein having a 
(31 , 3-galactosyltransf erase activity derived from a 
microorganism having an activity of transferring galactose 
to N-acetylglucos amine with pi , 3-linkage . A galactose- 
containing carbohydrate can be efficiently produced by 
using the enzyme. 
(0091) 

(Free text of sequence listing) 
SEQ ID NO: 4: Synthetic DNA 
SEQ ID NO: 5: Synthetic DNA 
SEQ ID NO: 6: Synthetic DNA 
SEQ ID NO: 7: Synthetic DNA 
SEQ ID NO: 8: Synthetic DNA 
(0092) 

( Sequence listing ) 

SEQUENCE LISTING 

<110> KYOWA HAKKO KOGYO CO., LTD. 

<120> Beta 1, 3-galactosyltransf erase and a DNA coding for said enzyme 

<130> 11328 

<140> 
<141> 

<160> 8 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 
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<212> PRT 

<213> Streptococcus agalactiae Type lb 
<400> 1 

Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn Tyr 
15 10 15 

Val Arg Asp Ser Val Glu Ser He Leu Asn Gin Thr Leu Thr Asp Phe 
20 25 30 

Glu Phe He He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys Gin 
35 40 45 

Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu Leu 
50 55 60 

Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val Lys 
65 70 75 80 

He Ser Lys Gly Glu Tyr lie Phe Arg Met Asp Ala Asp Asp He Ser 
85 90 95 

Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn Ser 
100 105 110 

Leu Asp Phe Ser Ala Thr Leu He Glu Leu He Asp Gin Lys Gly Asn 
115 120 125 

Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn Asp 
130 135 140 

He Arg Lys Met Leu Leu Asn Arg Ser He Leu Ala His Pro Thr Trp 
145 150 155 160 

Cys Val Lys Lys Lys Val Phe Asp Lys Leu Met Gly Tyr Arg Asp Leu 
165 170 175 

Val Pro Val Glu Asp Tyr Asp Phe Ala He Arg Gly Ala Leu Ala Asp 
180 185 190 

Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu Asn 
195 200 205 

Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr Ser 
210 215 220 

Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp lie Thr 
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225 230 235 240 

Lys He Thr Asn Tyr Phe Gin Glu Tyr Val He Lys Lys Arg Tyr Thr 
245 250 255 

Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser He 
260 265 270 

Thr He Arg Lys Leu Tyr He Cys Leu Tyr Leu Tyr Phe Lys Ser Pro 
275 280 285 

Leu Val Arg Arg Leu Leu He Asn Asp lie Asn He Leu Val Leu Lys 
290 295 300 

Leu Phe Gly Gly Glu Lys Gin Ser Asp 
305 310 313 

<210> 2 
<211> 
<212> DNA 

<213> Streptococcus agalactiae Type lb 
<400> 2 

atg aat tat agt ate att atg teg gta tat aat gag cct tta aat tat 48 
Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn Tyr 
15 10 15 

gtg aga gat tea gta gaa tct ata tta aat caa acg ctt act gat ttt 96 
Val Arg Asp Ser Val Glu Ser He Leu Asn Gin Thr Leu Thr Asp Phe 
20 25 30 

gag ttc ata att gtc att gat aat cca agt aga ggt gat tta aag caa 144 
Glu Phe He He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys Gin 
35 40 45 

ttc tta aca gaa tat tea gtt gta gat aat aga ata aaa ate ttg ctt 192 
Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu Leu 
50 55 60 

aat gaa gaa aat att ggt tta gca tea agt ttg aac aaa gcg gtg aaa 240 
Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val Lys 
65 70 75 80 

att tct aag gga gaa tat att ttt aga atg gat get gat gat att tea 288 
He Ser Lys Gly Glu Tyr He Phe Arg Met Asp Ala Asp Asp He Ser 
85 90 95 
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tat cca agt aga ttt gat aag caa att cgt ttt atg gag gaa aat tea 336 
Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn Ser 
100 105 110 

ttg gat ttc tea gca act eta ata gaa ttg ata gac caa aaa gga aat 384 
Leu Asp Phe Ser Ala Thr Leu lie Glu Leu He Asp Gin Lys Gly Asn 
115 120 125 

tta gta tat aaa caa cga gaa agt aat aaa ata tac tta act aat gat 432 
Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn Asp 
130 135 140 

ata egg aag atg tta ttg aat aga tct ata ctt gee cac cca acg tgg 480 
He Arg Lys Met Leu Leu Asn Arg Ser He Leu Ala His Pro Thr Trp 
145 150 155 160 

tgc gta aaa aag aaa gtt ttc gat aag tta atg gga tat aga gat tta 528 
Cys Val Lys Lys Lys Val Phe Asp Lys Leu Met Gly Tyr Arg Asp Leu 
165 170 175 

gta cct gtt gaa gat tat gat ttt gca ata aga gga get ctg get gat 576 
Val Pro Val Glu Asp Tyr Asp Phe Ala He Arg Gly Ala Leu Ala Asp 
180 185 190 

ttc aaa ate ggc tta etc aat aaa gta ctt tta cag tat aga tta aac 624 
Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu Asn 
195 200 205 

gag aat gga ata tea caa ace aat aag ttt aag caa tat att tac tea 672 
Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr Ser 
210 215 220 

get att tta caa gat ttt tat aaa gaa aaa tct tat att gat ate aca 720 
Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp He Thr 
225 230 235 240 

aaa att act aat tac ttt caa gag tat gtg ata aag aaa cgc tat act 768 
Lys He Thr Asn Tyr Phe Gin Glu Tyr Val He Lys Lys Arg Tyr Thr 
245 250 255 

cag caa gag etc tct aaa tat ttt gag eta aaa tct ace cct agt att 816 
Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser He 
260 265 270 

act att aga aaa eta tat att tgt tta tat tta tac ttt aag tct ccc 864 
Thr He Arg Lys Leu Tyr He Cys Leu Tyr Leu Tyr Phe Lys Ser Pro 
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275 



280 



285 



ttg gtt agg agg tta tta ata aat gat att aat att tta gta ctg aaa 912 

Leu Val Arg Arg Leu Leu lie Asn Asp He Asn He Leu Val Leu Lys 
290 295 300 

ttg ttt gga gga gag aaa caa agt gac 939 

Leu Phe Gly Gly Glu Lys Gin Ser Asp 
305 310 



<210> 3 
<211> 6865 
<212> DNA 

<213> Streptococcus agalactiae type lb 

<220> 
<221> CDS 

<222> (617).. (1789) 

<220> 
<221> CDS 

<222> (1816).. (2262) 

<220> 
<221> CDS 

<222> (2265).. (2744) 

<220> 
<221> CDS 

<222> (2843).. (3979) 

<220> 
<221> CDS 

<222> (3982).. (4953) 

<220> 
<221> CDS 

<222> (5009).. (5947) 
<400> 3 

agatcttgga gatattatct gtgaaaccaa tgttcctaga ctgatggtcg ttccttcagg 60 
gaaagtacca ccaaatccaa cagcattact tcagaacgct tattttaata agatgattga 120 



agctattaaa aatatatttg attatattat catcgatact ccacctattg gtttagttgt 180 



tgatgccgca 


ataatcgcta 


atgcttgcga 


tggttttatt 


ttagtaaccc 


aagcaggtag 240 


aataaaacgt 


aattatgttg 


aaaaagcaaa 


agaacagatg 


gaacaaagtg 


gttcaaagtt 300 


cttaggtatt 


attcttaata 


aagttaatga 


atctgttgct 


acttacggcg 


attatggaaa 360 


ttacggaaaa 


agggatagaa 


aaaggaagta 


aggggctctt 


gtattgaaag 


aaaaagaaaa 420 


tatacaaaag 


attattatag 


cgatgattca 


aaccgttgtg 


gtttattttt 


ctgcaagttt 480 


gacattaaca 


ttaattactc 


ccaactttaa 


aagcaataaa 


gatttattgt 


ttgttctatt 540 


gatacattat 


attgtctttt 


atctttctga 


tttttacaga 


gacttttgga 


gtcgtggcta 600 



tcttgaagag tttaaa atg gta ttg aaa tac age ttt tac tat att ttc ata 652 
Met Val Leu Lys Tyr Ser Phe Tyr Tyr He Phe He 
1 5 10 



tea agt tea tta ttt ttt att tct aaa aac tct ttt aca acg aca cga 700 
Ser Ser Ser Leu Phe Phe He Ser Lys Asn Ser Phe Thr Thr Thr Arg 
15 20 25 

ctt tec ttt ttt act ttt att get atg aat teg att tta tta tat eta 748 
Leu Ser Phe Phe Thr Phe He Ala Met Asn Ser He Leu Leu Tyr Leu 
30 35 40 

ttg aat tea ttt tta aaa tat tat cga aaa tat tct tac get aag ttt 796 
Leu Asn Ser Phe Leu Lys Tyr Tyr Arg Lys Tyr Ser Tyr Ala Lys Phe 
45 50 55 60 

tea cga gat ace aaa gtt gtt ttg ata acg aat aag gat tct tta tea 844 
Ser Arg Asp Thr Lys Val Val Leu He Thr Asn Lys Asp Ser Leu Ser 
65 70 75 

aaa atg acc ttt agg aat aaa tac gac cat aat tat ate get gtc tgt 892 
Lys Met Thr Phe Arg Asn Lys Tyr Asp His Asn Tyr He Ala Val Cys 
80 85 90 

ate ttg gat tec tct gaa aag gat tgt tat gat ttg aaa cat aac teg 940 
He Leu Asp Ser Ser Glu Lys Asp Cys Tyr Asp Leu Lys His Asn Ser 
95 100 105 

tta agg ata ata aac aaa gat get ctt act tea gag tta acc tgc tta 988 
Leu Arg He He Asn Lys Asp Ala Leu Thr Ser Glu Leu Thr Cys Leu 
110 115 120 
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act gtt gat caa get ttt att aac ata ccc att gaa tta ttt ggt aaa 1036 
Thr Val Asp Gin Ala Phe He Asn He Pro He Glu Leu Phe Gly Lys 
125 130 135 140 

tac caa ata caa gat att att aat gac att gaa gca atg gga gtg att 1084 
Tyr Gin He Gin Asp He He Asn Asp He Glu Ala Met Gly Val He 
145 150 155 

gtc aat gtt aat gta gag gca ctt age ttt gat aat ata gga gaa aag 1132 
Val Asn Val Asn Val Glu Ala Leu Ser Phe Asp Asn lie Gly Glu Lys 
160 165 170 

cga ate caa act ttt gaa gga tat agt gtt att aca tat tct atg aaa 1180 
Arg lie Gin Thr Phe Glu Gly Tyr Ser Val He Thr Tyr Ser Met Lys 
175 180 185 

ttc tat aaa tat agt cac ctt ata gca aaa cga ttt ttg gat : atc atg 1228 
Phe Tyr Lys Tyr Ser His Leu He Ala Lys Arg Phe Leu Asp He Met 
190 195 200 

ggt get att ata ggt ttg etc ata tgt ggc att gtg gca att ttt eta 1276 
Gly Ala He He Gly Leu Leu He Cys Gly He Val Ala He Phe Leu 
205 210 215 220 

gtt ccg caa ate aga aaa gat ggt gga ccg get ate ttt tct caa aat 1324 
Val Pro Gin He Arg Lys Asp Gly Gly Pro Ala He Phe Ser Gin Asn 
225 230 235 

aga gta ggt cgt aat ggt agg att ttt aga ttc tat aaa ttc aga tea 1372 
Arg Val Gly Arg Asn Gly Arg He Phe Arg Phe Tyr Lys Phe Arg Ser 
240 245 250 ' 

atg cga gta gat gca gaa caa att aag aaa gat tta tta gtt cac aat 1420 
Met Arg Val Asp Ala Glu Gin He Lys Lys Asp Leu Leu Val His Asn 
255 260 265 

caa atg acg ggg eta atg ttt aag tta gac gat gat cct aga att act 1468 
Gin Met Thr Gly Leu Met Phe Lys Leu Asp Asp Asp Pro Arg He Thr 
270 275 280 

aaa ata gga aaa ttt att cga aaa aca age ata gat gag ttg cct caa 1516 
Lys He Gly Lys Phe He Arg Lys Thr Ser He Asp Glu Leu Pro Gin 
285 290 295 300 

ttc tat aat gtt tta aaa ggt gat atg agt tta gta gga aca cgc cct 1564 
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Phe Tyr Asn Val Leu Lys Gly Asp Met Ser Leu Val Gly Thr Arg Pro 
305 310 315 



ccc aca gtt gat gaa tat gaa aag tat aat tea acg cag aag cga cgc 1612 
Pro Thr Val Asp Glu Tyr Glu Lys Tyr Asn Ser Thr Gin Lys Arg Arg 
320 325 330 

ctt agt ttt aag cca gga ate act ggt ttg tgg caa ata tct ggt aga 1660 
Leu Ser Phe Lys Pro Gly He Thr Gly Leu Trp Gin He Ser Gly Arg 
335 340 345 

aat aat att act gat ttt gat gaa ate gta aag tta gat gtt caa tat 1708 
Asn Asn He Thr Asp Phe Asp Glu He Val Lys Leu Asp Val Gin Tyr 
350 355 360 

ate aat gaa tgg tct att tgg tea gat att aag att att etc eta acg 1756 
lie Asn Glu Trp Ser lie Trp Ser Asp He Lys lie He Leu Leu Thr 
365 370 375 380 

eta aag gta gtt tta etc ggg aca gga get aag taaaggtaag gtttgaaagg 1809 
Leu Lys Val Val Leu Leu Gly Thr Gly Ala Lys 
385 390 

aatata atg aaa att tgt ctg gtt ggt tea agt ggt ggt cac eta gca 1857 
Met Lys lie Cys Leu Val Gly Ser Ser Gly Gly His Leu Ala 
395 400 405 

cac ttg aac ctt ttg aaa ccc att tgg gaa aaa gaa gat agg ttt tgg 1905 
His Leu Asn Leu Leu Lys Pro He Trp Glu Lys Glu Asp Arg Phe Trp 
410 415 420 

gta act ttt gat aaa gaa gat get agg agt att eta aga gaa gag att 1953 
Val Thr Phe Asp Lys Glu Asp Ala Arg Ser He Leu Arg Glu Glu He 
425 430 435 

gta tat cat tgc ttc ttt cca aca aac cgt aat gtc aaa aac ttg gta 2001 
Val Tyr His Cys Phe Phe Pro Thr Asn Arg Asn Val Lys Asn Leu Val 
440 445 450 

aaa aat act att eta get ttt aag gtc ctt aga aaa gaa aga cca gat 2049 
Lys Asn Thr He Leu Ala Phe Lys Val Leu Arg Lys Glu Arg Pro Asp 
455 460 465 

gtt ate ata tea tct ggt gec get gta gca gta cca ttc ttt tat att 2097 
Val He He Ser Ser Gly Ala Ala Val Ala Val Pro Phe Phe Tyr He 
470 475 480 485 
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ggt aag tta ttt ggc tgt aag acc gtt tat ata gag gtt ttc gac agg 2145 
Gly Lys Leu Phe Gly Cys Lys Thr Val Tyr He Glu Val Phe Asp Arg 
490 495 500 

ata gat aaa cca act ttg aca gga aaa tta gtg tat cct gta aca gat 2193 
He Asp Lys Pro Thr Leu Thr Gly Lys Leu Val Tyr Pro Val Thr Asp 
505 510 515 

aaa ttt att gtt cag tgg gaa gaa atg aaa aaa gtt tat cct aag gca 2241 
Lys Phe He Val Gin Trp Glu Glu Met Lys Lys Val Tyr Pro Lys Ala 
520 525 530 

att aat tta gga gga att ttt ta atg att ttt gtc aca gta ggg aca 2288 
He Asn Leu Gly Gly He Phe Met He Phe Val Thr Val Gly Thr 
535 540 545 

cat gaa cag cag ttc aac cgt ctt att aaa gaa gtt gat aga tta aaa 2336 
His Glu Gin Gin Phe Asn Arg Leu He Lys Glu Val Asp Arg Leu Lys 
550 555 560 

ggg aca ggt get att gat caa gaa gtg ttc att caa acg ggt tac tea 2384 
Gly Thr Gly Ala He Asp Gin Glu Val Phe He Gin Thr Gly Tyr Ser 
565 570 575 580 

gac ttt gaa cct cag aat tgt cag tgg tea aaa ttt etc tea tat gat 2432 
Asp Phe Glu Pro Gin Asn Cys Gin Trp Ser Lys Phe Leu Ser Tyr Asp 
585 590 595 

gat atg aac tct tac atg aaa gaa get gag att gtt ate aca cac ggc 2480 
Asp Met Asn Ser Tyr Met Lys Glu Ala Glu He Val He Thr His Gly 
600 605 610 

ggt cca gca acg ttt atg aat gca gtt tct aaa ggg aaa aaa act att 2528 
Gly Pro Ala Thr Phe Met Asn Ala Val Ser Lys Gly Lys Lys Thr He 
615 620 625 

gtg gtt cct aga caa gaa cag ttt gga gag cat gtg aat aat cat cag 2576 
Val Val Pro Arg Gin Glu Gin Phe Gly Glu His Val Asn Asn His Gin 
630 635 640 

gtg gat ttt ttg aaa gag tta ttc ttg aaa tat gag tta gat tat att 2624 
Val Asp Phe Leu Lys Glu Leu Phe Leu Lys Tyr Glu Leu Asp Tyr lie 
645 650 655 660 

ttg aat ate agt gaa tta gag aat att att aag gaa aaa aat ata tct 2672 
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Leu Asn He Ser Glu Leu Glu Asn He lie Lys Glu Lys Asn lie Ser 
665 670 675 



act agt aaa gta ata tea caa aac aat gat ttt tgt tec tct ttc aaa 2720 
Thr Ser Lys Val He Ser Gin Asn Asn Asp Phe Cys Ser Ser Phe Lys 
680 685 690 

aat gaa ctt tct aaa eta ttt gaa taaatatatt ttgttggaga aaaaaattga 2774 
Asn Glu Leu Ser Lys Leu Phe Glu 
695 700 

aattaactat caatccaaag tatttgttaa taggaggaat tttegcttta accctatttt 2834 

caaageca atg caa ctt ttg tta ctt tta gca tta ata gtt tta ctt att 2884 
Met Gin Leu Leu Leu Leu Leu Ala Leu He Val Leu Leu He 
705 710 

tgt agt agt tat aat gaa aaa atg aaa ttt tta aat atg get gaa att 2932 
Cys Ser Ser Tyr Asn Glu Lys Met Lys Phe Leu Asn Met Ala Glu He 
715 720 725 730 

ttt ttc att gta ttt tat atg gtt tat tta gta tea ata gta tta aat 2980 
Phe Phe He Val Phe Tyr Met Val Tyr Leu Val Ser He Val Leu Asn 
735 740 745 

teg tta ttt aga agt cca gaa ttt cat aga gtc att get gca ttc aat 3028 
Ser Leu Phe Arg Ser Pro Glu Phe His Arg Val He Ala Ala Phe Asn 
750 755 760 

tea ctg gca gta ggg gtt gtg tec tta tta ttt tac cat tac tat aag 3076 
Ser Leu Ala Val Gly Val Val Ser Leu Leu Phe Tyr His Tyr Tyr Lys 
765 770 775 

aat act aat att gaa tta aca aaa ttg eta aaa tea ttt ttg ttt aat 3124 
Asn Thr Asn He Glu Leu Thr Lys Leu Leu Lys Ser Phe Leu Phe Asn 
780 785 790 

gca att att ttg ttt tgt tta gga ttt eta tat tat tat gee ata tat 3172 
Ala He He Leu Phe Cys Leu Gly Phe Leu Tyr Tyr Tyr Ala He Tyr 
795 800 805 810 

ttt gat gta gag aat gta agt ctt ttt gga aga aat tta att gga tea 3220 
Phe Asp Val Glu Asn Val Ser Leu Phe Gly Arg Asn Leu He Gly Ser 
815 820 825 

gat tgg ata aat ggg atg cat acg cag aga gca atg get ttc ttt gaa 3268 
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Asp Trp He Asn Gly Met His Thr Gin Arg Ala Met Ala Phe Phe Glu 
830 835 840 



tat tea aat ctt ata ata ccc tta act ate ata act aat ata tat ata 3316 
Tyr Ser Asn Leu He He Pro Leu Thr He He Thr Asn He Tyr He 
845 850 855 

tat ata tat att aag caa aga tat age tea ggg atg atg ata etc ggt 3364 
Tyr He Tyr He Lys Gin Arg Tyr Ser Ser Gly Met Met He Leu Gly 
860 865 870 

get ctt etc tec act att ata eta ccc ate ggg tct gga tct aga get 3412 
Ala Leu Leu Ser Thr He He Leu Pro He Gly Ser Gly Ser Arg Ala 
875 880 885 890 

ggt att ata gtt gtg eta eta cag gtt ata att tta ttg ttg aat aca 3460 
Gly He He Val Val Leu Leu Gin Val He He Leu Leu Leu Asn Thr 
895 900 905 

att gta ata aaa aga caa acg ata aga ttt ttc ctg tat tta gtt ccg 3508 
lie Val lie Lys Arg Gin Thr He Arg Phe Phe Leu Tyr Leu Val Pro 
910 915 920 

ata eta ata tta eta tta gtg ata tta cgt ttt gat aat ttg gtg age 3556 
He Leu He Leu Leu Leu Val He Leu Arg Phe Asp Asn Leu Val Ser 
925 930 935 

ata tat aat aga ata ate aat ttg egg teg gga agt agt gaa tct aga 3604 
He Tyr Asn Arg He He Asn Leu Arg Ser Gly Ser Ser Glu Ser Arg 
940 945 950 

ttt tct ttg tac aag gat acc gta cac tea gta att act gac tea eta . 3652 
Phe Ser Leu Tyr Lys Asp Thr Val His Ser Val He Thr Asp Ser Leu 
955 960 965 970 

ttt ctg gga aaa ggt gta aaa gaa ttg tgg tta aat agt gat tta cca 3700 
Phe Leu Gly Lys Gly Val Lys Glu Leu Trp Leu Asn Ser Asp Leu Pro 
975 980 985 

eta gga teg cat teg acc tac ata ggt tat ttc tat aaa act ggc eta 3748 
Leu Gly Ser His Ser Thr Tyr He Gly Tyr Phe Tyr Lys Thr Gly Leu 
990 995 1000 

ttt gga eta ata aat gtg att tta ggt ttg ttt eta att ctt att age 3796 
Phe Gly Leu He Asn Val He Leu Gly Leu Phe Leu He Leu He Ser 
1005 1010 1015 
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att ate aag gaa get aaa aag tea gat ttc tat tat gag ata gta ggg 
He He Lys Glu Ala Lys Lys Ser Asp Phe Tyr Tyr Glu He Val Gly 
1020 1025 1030 



3844 



tct gtc ata etc eta ttt tea ttt ttt gca ctt gaa gat att gat ggc 
Ser Val He Leu Leu Phe Ser Phe Phe Ala Leu Glu Asp lie Asp Gly 
1035 1040 1045 1050 



3892 



gee aat tgg etc att att ttt gtc ttt aca gtg ttg gga att tta gaa 
Ala Asn Trp Leu He He Phe Val Phe Thr Val Leu Gly He Leu Glu 
1055 1060 1065 



3940 



aat aag gat ttc tat agt caa ctt aaa agg tgg gaa agt ta atg gaa 
Asn Lys Asp Phe Tyr Ser Gin Leu Lys Arg Trp Glu Ser Met Glu 
1070 1075 1080 



3987 



aaa caa ata ctt gtt tct ate gtt ata cct ata tac aac teg gaa gca 
Lys Gin He Leu Val Ser He Val He Pro He Tyr Asn Ser Glu Ala 
1085 1090 1095 



4035 



tat ctt aaa gaa tgc gtg caa tec gtc eta caa cag act cat tea ttg 
Tyr Leu Lys Glu Cys Val Gin Ser Val Leu Gin Gin Thr His Ser Leu 
1100 1105 1110 



4083 



ata gaa gtt ata ctg att aat gat gga tec act gat aat agt gga gaa 
He Glu Val He Leu He Asn Asp Gly Ser Thr Asp Asn Ser Gly Glu 
1115 1120 1125 



4131 



att tgt gat aat tta tct caa aaa gac gat cgc ata ctt gta ttt cat 
He Cys Asp Asn Leu Ser Gin Lys Asp Asp Arg lie Leu Val Phe His 
1130 1135 1140 1145 



4179 



aaa aaa aat gga ggg gta tct teg gca agg aac eta ggt ctt gat aaa 
Lys Lys Asn Gly Gly Val Ser Ser Ala Arg Asn Leu Gly Leu Asp Lys 
1150 1155 1160 



4227 



tec aca ggc gaa ttc ata acg ttt gta gat agt gat gat ttt gta gca 
Ser Thr Gly Glu Phe He Thr Phe Val Asp Ser Asp Asp Phe Val Ala 
1165 1170 1175 



4275 



ccg aat ata att gaa ata atg tta aaa aat tta ate act gag gat get 
Pro Asn He He Glu He Met Leu Lys Asn Leu He Thr Glu Asp Ala 
1180 1185 1190 



4323 



gat ata gca gaa gta gat ttt gat att teg aat gag aga gat tat aga 4371 
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Asp He Ala Glu Val Asp Phe Asp He Ser Asn Glu Arg Asp Tyr Arg 
1195 1200 1205 



aag aaa aaa aga cga aac ttt tat aag gtc ttt aaa aac aat aat tct 4419 
Lys Lys Lys Arg Arg Asn Phe Tyr Lys Val Phe Lys Asn Asn Asn Ser 
1210 1215 1220 1225 

tta aaa gaa ttt tta tea ggt aat aga gtg gaa aat att gtt tgt aca 4467 
Leu Lys Glu Phe Leu Ser Gly Asn Arg Val Glu Asn He Val Cys Thr 
1230 1235 1240 

aaa tta tat aaa aaa agt ata att ggt aac ttg agg ttt gat gag aat 4515 
Lys Leu Tyr Lys Lys Ser He He Gly Asn Leu Arg Phe Asp Glu Asn 
1245 1250 1255 

tta aaa att ggt gag gat tta ctt ttt aat tgt aaa att tta tgt caa 4563 
Leu Lys He Gly Glu Asp Leu Leu Phe Asn Cys Lys lie Leu Cys Gin 
1260 1265 1270 

gag cac tgc ata gtc gta gat acg act tct tec ttg tac acc tat cgc 4611 
Glu His Cys He Val Val Asp Thr Thr Ser Ser Leu Tyr Thr Tyr Arg 
1275 1280 1285 

ate gta aag act tct gca atg aat cag gag ttc aac gaa aat tea tta 4659 
He Val Lys Thr Ser Ala Met Asn Gin Glu Phe Asn Glu Asn Ser Leu 
1290 1295 1300 1305 

gat ttt ata aca att ttt aat gaa ata age agt att gtt cct gca aaa 4707 
Asp Phe He Thr He Phe Asn Glu He Ser Ser He Val Pro Ala Lys 
1310 1315 1320 

tta get aat tat gtt gaa gcg aaa ttt tta aga gaa aag gta aag tgt 4755 
Leu Ala Asn Tyr Val Glu Ala Lys Phe Leu Arg Glu Lys Val Lys Cys 
1325 1330 1335 

etc cga aaa atg ttt gaa tta ggt agt aat att gac agt aaa ate aaa 4803 
Leu Arg Lys Met Phe Glu Leu Gly Ser Asn He Asp Ser Lys He Lys 
1340 1345 1350 

tta caa cga gag att ttt ttc aaa gat gtt aaa tta tac cct ttc tat 4851 
Leu Gin Arg Glu He Phe Phe Lys Asp Val Lys Leu Tyr Pro Phe Tyr 
1355 1360 1365 

aaa gcg gtt aag tac tta tea tta aag gga tta ttg agt att tac tta 4899 
Lys Ala Val Lys Tyr Leu Ser Leu Lys Gly Leu Leu Ser He Tyr Leu 
1370 1375 1380 1385 
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atg aaa tgt tea ccc ate ttg tat ata aaa tta tat gac agg ttt caa 4947 
Met Lys Cys Ser Pro He Leu Tyr He Lys Leu Tyr Asp Arg Phe Gin 
1390 1395 1400 

aaa cag taagtaatca aaaattaaat taactcaatt accttttaaa ttataggagt 5003 
Lys Gin 

tgaaa atg aat tat agt ate att atg teg gta tat aat gag cct tta aat 5053 
Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn 
1405 1410 1415 

tat gtg aga gat tea gta gaa tct ata tta aat caa acg ctt act gat 5101 
Tyr Val Arg Asp Ser Val Glu Ser He Leu Asn Gin Thr Leu Thr Asp 
1420 1425 1430 

ttt gag ttc ata att gtc att gat aat cca agt aga ggt gat tta aag 5149 
Phe Glu Phe lie He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys 
1435 1440 1445 1450 

caa ttc tta aca gaa tat tea gtt gta gat aat aga ata aaa ate ttg 5197 
Gin Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu 
1455 1460 1465 

ctt aat gaa gaa aat att ggt tta gca tea agt ttg aac aaa gcg gtg 5245 
Leu Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val 
1470 1475 1480 

aaa att tct aag gga gaa tat att ttt aga atg gat get gat gat att 5293 
Lys He Ser Lys Gly Glu Tyr He Phe Arg Met Asp Ala Asp Asp He 
1485 1490 1495 

tea tat cca agt aga ttt gat aag caa att cgt ttt atg gag gaa aat 5341 
Ser Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn 
1500 1505 1510 

tea ttg gat ttc tea gca act eta ata gaa ttg ata gac caa aaa gga 5389 
Ser Leu Asp Phe Ser Ala Thr Leu He Glu Leu He Asp Gin Lys Gly 
1515 1520 1525 1530 

aat tta gta tat aaa caa cga gaa agt aat aaa ata tac tta act aat 5437 
Asn Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn 
1535 1540 1545 

gat ata egg aag atg tta ttg aat aga tct ata ctt gec cac cca acg 5485 
Asp He Arg Lys Met Leu Leu Asn Arg Ser He Leu Ala His Pro Thr 
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1550 1555 1560 

tgg tgc gta aaa aag aaa gtt ttc gat aag tta atg gga tat aga gat 5533 
Trp Cys Val Lys Lys Lys Val Phe Asp Lys Leu Met Gly Tyr Arg Asp 
1565 1570 1575 

tta gta cct gtt gaa gat tat gat ttt gca ata aga gga get ctg get 5581 
Leu Val Pro Val Glu Asp Tyr Asp Phe Ala lie Arg Gly Ala Leu Ala 
1580 1585 1590 

gat ttc aaa ate ggc tta etc aat aaa gta ctt tta cag tat aga tta 5629 
Asp Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu 
1595 1600 1605 1610 

aac gag aat gga ata tea caa acc aat aag ttt aag caa tat att tac 5677 
Asn Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr 
1615 1620 1625 

tea get att tta caa gat ttt tat aaa gaa aaa tct tat att gat ate 5725 
Ser Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp He 
1630 1635 1640 

aca aaa att act aat tac ttt caa gag tat gtg ata aag aaa cgc tat 5773 
Thr Lys He Thr Asn Tyr Phe Gin Glu Tyr Val He Lys Lys Arg Tyr 
1645 1650 1655 

act cag caa gag etc tct aaa tat ttt gag eta aaa tct acc cct agt 5821 
Thr Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser 
1660 1665 1670 

att act att aga aaa eta tat att tgt tta tat tta tac ttt aag tct 5869 
He Thr He Arg Lys Leu Tyr He Cys Leu Tyr Leu Tyr Phe Lys Ser 
1675 1680 1685 1690 

ccc ttg gtt agg agg tta tta ata aat gat att aat att tta gta ctg 5917 
Pro Leu Val Arg Arg Leu Leu He Asn Asp He Asn He Leu Val Leu 
1695 1700 1705 

aaa ttg ttt gga gga gag aaa caa agt gac taatagaaaa atttatgtat 5967 
Lys Leu Phe Gly Gly Glu Lys Gin Ser Asp 
1710 1715 

gtcatactct ttatcattta ttgatttgtt tatataaaga agagatatat tcaaatttag 6027 

aaattattct ctcttcttct attcctgatg ttgataattt agagaaaaaa ttaaaatcaa 6087 
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aaacaataaa 


tatacatatt 


ttagaagaat 


tacttaaaga 


tgctggtcta 


agttatagta 


atgcaacgcc 


tattgggagg 


acactaataa 


atggtttaaa 


ttgttttact 


tactctatat 


aaaaatatat 


tcttcacaaa 


attcagccac 


aagttaattc 


attagttaat 


ttgccaaagg 


ctaggaaaga 


actttttgac 


aatgtaacag 


ttggagcagt 


aagagttagt 


attaaatcac 


ctatagataa 


agagtttatg 


agttataaca 


atttttataa 


atcaatagtc 


aatgaatata 


ttcatcctag 


agatgtagta 


gattattcca 


ttcctatgga 


aattatagag 


ttgatgttaa 


cgtccactgc 


gctggatttt 


ttaacttgtg 



<210> 4 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 4 

gggggatcca atggtattga aatacag 



ctagtggtga aagtgaagaa ttattatcag 6147 
agtttgatag taattgtttt atttttaatg 6207 
agcatggtat ttattataat ctaattgaag 6267 
ttagtcaaaa actttggaag tattatgtaa 6327 
atggattttc acgatattgt ttagggattg 6387 
atccgcgtta taaaaaattt attgaagtcc 6447 
aatatcaaaa agaaatggca ataaatcttt 6507 
cttcagtact agtattaacg cagcctctat 6567 
ataagataga aacgtccgaa gaacaattta 6627 
taaataaagg gtacaatgtt tatttaaaag 6687 
aattgccggt agagctatta ccatcaaatg 6747 
caggtcggtt cgaatgtggg ataacacatt 6807 
ttgataaaaa aataacttta gtagatct 6865 



<210> 5 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 5 

aatctgcaga cttagctcct gtcccgagt 29 
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<210> 6 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 6 

ccaagcggcc gctatagtca acttaaaagg tgg 33 



<210> 7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 7 

cggctcgagt cccaataggc gttgcatc 28 



<210> 8 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<400> 

ccggaattcg aaaaggtaaa gtgtctccga aa 32 



(Brief explanation of the drawings) 

(Fig. 1) shows the structure of capsular 
polysaccharide biosynthesis genes in Streptococcus 
agalactiae Type la and Type lb. 

(Fig. 2) shows the construction steps of 
pi , 3-galactosyltransf erase plasmids pBBPIJ and pBBPJ. 
(Explanation of symbols) 
Viae, lac promoter 
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cpsl: 01 , 3-N-acetylglucosaminyltransf erase gene 
cpsJ: pi , 4-galactosyltransf erase gene 
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ument name) 
Drawings 

. 1) 



(Fig. 2) 




(Document name) 
Abtract 
(Abstract ) 
(Problems ) 

It is to provide a protein having a |3l,3- 
galactosyltransf erase activity, a DNA encoding the protein, 
a method for producing a protein having a 
(31 , 3-galactosyltransf erase activity using a transformant 
containing the DNA, and a method for producing a galactose- 
containing carbohydrate using the protein. 
(Means for solution) 

The present invention provides a protein having a 
pi , 3-galactosyltransf erase activity, a DNA encoding the 
protein, a recombinant DNA containing the DNA, a 
transformant containing the recombinant DNA, a method for 
producing a protein having a (31 , 3-galactosyltransf erase 
activity using the transformant, and a method for producing 
a galactose-containing carbohydrate using the transformant. 
(Selected drawing) 

None 
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7 ' D h ^-^X • -f > • ^ 7 - • ;W*oy-, DNA Cloning 1: Core Te 

chniques, A Practical Approach, Second Edition, Oxford University (1995) 
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IS&D NAi: LTH#f$£f±, B L A S T A S T A^£ffll^Tst3l Lfc 2: £ 
tc N IB?iJ#^2^£ti3i«ie?iJh'>& <i:t,60 % W±©*ilSJ<££*<r 3 D 
NA, »H<ii8 0^±®ffi^tnDNA, £ £ L < fct 9 5 % 

*I £ * 1" £ D N A § & # 3 d i: *s -e £ § o 
[0 0 16] 

^IBtSJC^StC^oT^^^-DNAh^MI-SChTlIM^^D NASftgiL 

[0017] 

[1] *^B^©D N A©iiS^ 

Type I MS £ fetf SC^tftt*. 
[0018] 

Zh\s7h=iy*Xlzm?%Wl&®%4kime>fi&[mZ.t£, J. Bactriol., 181 
, 5176 (1999)izlB»©;5an£J; t>*&*f*o 
iSHSL £r£a®;£& (#ij;L&\ * u> h • :7-d hn-;ux • >f > • ^^^7 

•S o 

#&l!8©DNA£^tf»r#fci\ * M/:7M3 yfc;*.. 713^>7?-x. Type III 
fiEDNA^ffl^t, ;W ^y-tf>f ^>3 >& N P C Rfeft tlZ =fc b f 3 
[0 0 19] 



mmm^=H 1 2- 2 1 1 a 4 g ; 9/ 

5 A*, ^fitjCtt, ZAP Express^ r5-*y->#:M, Strategies, 5, 58 (199 
2)], pBluescript II SK( + )[* h 7 * v — Nucleic Acids Res., 17, 94 

94 (1989)], Azap II (* h — ^ AgtlO, Agtl 1 [DNA Cloning, A 

Practical Approach, 1, 49 (1985)], A TriplEx ( Z O — >^ y ^ftgi) , A 
ExCell (7^$/* A • 7 7^?>>7 • ;W jTt 1 £*t*0 , pUC18[Gene, 33, 103 

(1985)]3S£&tf 
[ 0 0 2 0 ] 

z $ - £±ia -e mm u fe^Bj© d n a & u rf4§ e> *x & ^ d n 

&tim^-rtiX~ ; bm^Z>Z.£.tf-(:g&fiK M£W:ifi, Escherichia coli XL1-B 
lue MRF 5 [Xh7^y->ti Strategies, 5, 81 (1992)], Escherichia col 
i C600 [Genetics, 39, 440 (1954)], Escherichia coli Y1088 [Science, 222, 
778 (1983)], Escherichia coli Y1090 [Science, 222, 778 (1983)], Escheri 
chia coli NM522 [J. Mol. Biol., 166, 1 (1983)], Escherichia coli K802 [J 
. Mol. Biol., 16, 118 (1966)], Escherichia coli JM105 [Gene, 38, 275 (19 

85)]^&&tfscii:#-c#3 0 

[ 0 0 2 1 ] 

$>tiav>rn*)fflU§Ch^T-&, #'Jx.fc£, A-f ;t >£ffl^3£&[Pro 

c. Natl. Acad. Sci. USA, 69, 2110 (1972)], 70^7^^^ (^§!Bg63-24 
83942) , r D# lx — >> 3 >&[Nucleic Acids Res., 16, 6127 (1988)]*$ 

[ 0 0 2 2 ] 

D N A £ tl 3 « D N A ©HSie?iJ SmSCttfflS, JSSffifll 
©&£tCf£, MfflV^;ft 3 i£SIE?'J ^Jx^v^^-^rv^tProc. Nat 
1. Acad. Sci. USA, 74, 5463 ( 1977)] * fc &373A- D N A is — Z x. (^- 



[ 0 0 2 3 ] 

±8Bfc:fcH^T&^£ih.fcDNA©EaSiE^JC£t5^ , rN M-b7 , f-f7 

hia coli JM109/pBBPJ & & £ d t A* f § § 0 
[2] *^B^©g&H©g|gi 

{iWTO^&tCfc bs ±13 [1] X t)fRff OfczMBBIiCDD N A*^fi«B 

[ 0 0 2 4 ] 

=i- Kf SSPfl-CJB&ie?!]^ 1i±©563St3«ai«:zi H>hft3=fc3^ 

#$8W©DNA£&3i-r£BiCtesi#fcJu ±ibdn Kmft&m^tz&m^'?? 

[ 0 0 2 5 ] 

^afc©**^^ £>ft£ 0 
[0 0 2 6 ] 



mmm^=K 1 2 - 2 1 1 a 4 ^ : u/ 

[ 0 0 2 7 ] 

pHelixl (Pvo. • *"-f 7V J ^ Z X&m) , pKK 
233-2 (7Yi/t A • 77;v?v7 • A*-f ^f^M) , pSE280 U>t"hns; 

^>#M) , pGEMEX-i (rn^^ti) , pQE-8 {*7fy#m ^ PET-3 (y^* 
^>&SO , PKYP10 (^HBS58-110600) , P KYP200[Agric. Biol. Chem. , 48, 
669 (1984)], pLSAl[Agric. Biol. Chem., 53, 277 (1989)], pGELl[Proc. Nat 
1. Acad. Sci., USA,. 82, 4306 (1985)], pBluescriptl I SK(+), pBluescript I 
I KS(+) (^ *?V—>#Mk) , pTrS30 [*Jl§ffiJM109/pTrS30(FERM BP-5407) ;i 
£"9 HUSH, pTrS32 [*Jj§SJM109/pTrS32(FERM BP-5408)ct t> mm.], PPAC31 (WO 
98/12343), pUC19 [Gene, 33, 103 (1985)], pSTV28(S?@^tt^), pUCU8 (^ 

mm±m) , ppai (^Bg63-23379.8) m^m^t % z t #-r § s 0 

[ 0 0 2 8 ] 

&&&©T&c!;^o tr£7'D^-^- (PtrpJ , lac^'D^E — ^— (PI 

ac) , P L 7'D^-^- p r7 - d ^_^_ n P SE :/n^-*-3|©, AJ^S^^ 

/--^^t s^-r-g) 7u^-#—^ spoi/d^- spo27d^-^ 

— , penP7D^-|^|^:}:!!) s t^ 0 i; fcP trp£ 2 o{fi?!j£ it 
tz7'u=E—$— (Ptrp_ x 2) , tacT'o^e — , lacT77'n^— , let I 

[ 0 0 2 9 ] 

U 5j? V — A fcg£-iS#J t$§ytO- # y (Shine-Dalgarno) I3?'J £ mife 
3 K>i:©P^^M^^gg^ (#<J;i&*6~18i&S) ^S|gfi bfc 7'^^ =• b*£ffl^3 



mmm^r=H 1 2 - 2 1 1 A 4 H: 12/ 

[ 0 0 3 0 ] 

JWM^t^Xte, 3L^x. u t^JSs -fe^^TJg N ^*5 L ;i/7l, 7i/Wf 

* h Uy'h^ ytixmmzm-t^U^s #J;Li±\ Escherichia coli XLl-Blue 
„ Escherichia coli XL2-Blue, Escherichia coli DHU Escherichia coli MC10 
00 N Escherichia coli KY3276, Escherichia coli W1485, Escherichia coli JM 
109, Escherichia coli HB101, Escherichia coli No.49 N Escherichia coli W3 
110 N Escherichia coli NY49 N Serratia ficaria , Serratia fonticola , Serrat 
ia liquef aciens , Serratia marce'scens . Bacillus subtil is. Bacillus me gate 
rium . Bacillus amylol iquef aciens , Brevibacterium immariophilum ATCC14068 
, Brevibacterium saccharolyticum ATCC14066, Corynebacterium ammmoniagene 
s, Corynebacterium glutamicum ATCC13032, Corynebacterium glutamicum ATCC 
14067, Corynebacterium glutamicum ATCC13869, Corynebacterium acetoacidop 
hi 1 urn ATCC 13870, Microbacterium ammoniaphilum ATCC 15354, Pseudomonas sp. 
D-0110\ Streptococcus agalactiae Type la. Streptococcus agalactiae Type 
Ib s Streptococcus agalactiae Type II I, Streptococcus pneumoniae Type 14 

[ 0 0 3 1 ] 

c. Natl. Acad. Sci., USA, 69, 2110 (1972)], 7 , Dh7 , 77hS (^HHg63-2 
483942) , x.'U£ h D#U — */ s >&[Nucleic Acids Res., 16, 6127 (1988)] 

[ 0 0 3 2 ] 

mmm^^m^mmtLxm^^m^^^ mm^^^-tLx, #yx.i±\ ye 

Pl3 (ATCC37115) , YEp24 (ATCC37051) , YCp50 (ATCC37419) , pHS19, pHS15^ 



a JlfcHB W-mi3*Z 1R 5B 
fg = H 1 2 - 2 1 1 A 4 M: 13/ 57 



ffiv^T&J;<, 0»j£fcf N PHOsro^E-*-, pgictd^-^-, gapz/o^e-^ — 

, ADH:To^-#-, gal 17 - D^E-^- n gal 107'n^-*-, h— hJ/ay* 
^'J^^F/P^-^- MFal yu^E — , CUP li/n^e — $g©:7* D 

[ 0 0 3 3 ] 

3l*eS4*£&W r *C:fca s '££x Saccharomyces cerevisi 

ae > Schizosaccharomyces pombe , Kluyveromyces lactis , . Trichosporon pullul 
ans, Schwanniomyces al.luvius , Pichia pastoris , Candida utilis e £ & If 

[ 0 0 3 4] 

•fti'bm^&CLk.ifiT'g^ Mz.it, b D^l/- >&[Methods Enzyraol 

., 194, 182 (1990)], ^7iD77^hS[Proc. Natl. Acad. Sci., USA, 81, 
4889 (1984)], 'J ^ Ai£[J. Bacteriol., 153, 163 (1983)]^£&tf-"3 

[ 0 0 3 5 ] 

iMi^iiintffli^^i^cjt mm^^f-tLx, mz.i£. pcdnai 

, pcDM8 (yi-Zi^nX b rUIK) , pAGE107 (^§§¥3-22979) n pAS3-3 ('#li¥2 
-227075) , pCDM8[Nature, 329, 840 (1987)], pcDNAI/Amp (-T > tT h D j; i > 
*fcS0 , PREP4 (Othn^i>tt!{) , pAGE103[J. Biochem, 101, 1307 (19 
87)], pAGE210, pAMo, pAMoA^I £ ffl I > £ C fctfT? £3o 
[0 0 3 6 ] 

C^^-et, mZ-&. Y *13ut>J (CMV) CDIE (immediate early) 3® 



r?=Hl 2-2 1 1A4 g . _4/ 



^6 — , u r n t><< )IX<D7D^: — , h-hi/3 7^7 7 , D ; E-^-, SR 
[ 0 0 3 7 ] 

X -yWXU r*-^«Bft_, th©iJST?$5t7;wt (Namalwa) tt^fettNam 
alwa KJM-1M, t MSJEHMJ^ b. r 77 U * =• K Uif ;i/Wm 

*fflfl&, ^^-X • ^A^^-O,«-eS)SCH0«s HBT5637 (4# UBS 63 -29 9 

[ 0 0 3 8 ] 

v-j7* • Un-7ilhlt(±, SP2/0, NSO^g, 7n • ^n-x^j&i: 
LT«YB2/0#, t h^ffli«^ LT{iHEK293(ATGC: CRL-1573), 293#, t 
h BALL-1^£, 77 U * ^ K U tf ;u»JKSMSi: LTfcfcCOS-1 

tt^lftlfcm^C ^ U ^ h n # U > a [Cytotechnol 

ogy, 3, 133 (1990)], >J >i?£;W> -j? A & (^¥2-227075) , 
H>^[Proc. Natl. Acad. Sci., USA, 84, 7413 (1987)], Virology, 52, 456 
(1973){CIBmo^^^^$)^^^i:^T- § So 
.[.0 0 3 9] 

fi**B]fl_£?s£t: LTffl^3«_rt::fd\ 0J*fcfBaculovirus Expression Vecto 
rs, A Laboratory Manual, W. H. Freeman and Company, New York (1992), ti 

h • y'u hzr-;i/X • -i > • ^-7- . j^yf v — , Molecular Biol 

ogy, A Laboratory Manual, Bio/Technology, 6, 47 (1988)^fzia$s£nfc;£i£ 

[ 0 0 4 0 ] 



&%&lZls],\Tm^t>tiZfcte?-m\'*&* — k. LTJis PVL1392, p 

VL1393, pBlueBacIII ( t & K -Y > tf h n *J jl m%&U % Z h #f § 3 

o 

[ 0 0 4 1 ] 

7<>h^777'* | j • 7,9 \s7— • zKU^ Fns/x- ^-f ;i/X (Aut 

ographa californica nuclear polyhedrosis virus) ^££3 S C h ^ So 
• SiliflSi: LTte\ Spodoptera frugiperda g>gBMjSfflflf-U Trichoplusia ni©|p 

[0 0 4 2] 

Spodoptera frugiperda g)Jj1 j££fflfi5>> UTfctSffl. Sf21 (m*^- a o *7 << ;i/X • -f 
^XXL/ y %/ 3 > •■ ^^7#— X 7 ■ 7^-5 h U — • Tra7;i/) ^ Trichoplu 
sia ni<Z>0fllMBI/jSi: LTiiHigh 5, BTI-TN-5B1-4 ( •<> tT h n ^ i >#S!) ^ - 
fc'f 3$MI&3fe©i§IISIIJl&i: U-C^ Bombyx mori N4^ftlf4Cfctf-c§5 0 

(^¥2-227075) , U^7i^^3>?fe[Proc. Natl. Acad. Sci., USA, 84, 
7413 ( 1987) ]3? £ fctf 3 „ 

[ 0 0 4 3 ] 

*IMfl££?g±«t LTffl^51^t(t ^^^-ilt, T 

7"d^e— ^ — t ltd Attains ^sbsi-c t «*©"Csti«v^-rncD*)0* 

ffl^T&£<, #ijx.fc£, # i; 77 V— ^e-tf^f £ Tj7-T J),* (CaMV) ©35SXo^E — 
[ 0 0 4 4 ] 



te^?tii>m^ZZb.ifi-Zg* fflz.&s T9un>7y-\)tH± ( Agrobacterium ) 
(^HfiS59-140885, Bg60-70080, W094/00977) N x^FD*^y 3 > 
m (#1181060-251887) , /i-r 4 £fflV^£& 
606856, ^fF^2517813) ^^l^cfctft-ts, 
[0 0 4 5 ] 

[ 0 0 4 6 ] 

■rac £5„ 

[ 0 0 4 7 ] 
[0 0 4 8] 



■-Zj = H 1 2 - 2 1 1 A 4 M: 17/ 



[ 0 0 4 9 ] 
[ 0 0 5 0 ] 

trfi7' □ ^ — * — £ ffl 3gij| ^ * * — T? J&K*g& L fcte'-fel * tg#-T 
[ 0 0 5 1 ] 

fli&*lTV>5RPMI1640igil&[J. Am. Med. Assoc., 199. 519 (1967)], EagleCDME 
MJgitt Science, 122, 501 (1952)], DMEMt£l& [Virology, 8, 396 (1959)], 199 
JgJ&[Proc. Soc. Biol. Med., 73, 1 (1950)] g= fciidft t>t%mz^ffe}%!in.m%? 

[ 0 0 5 2 ] 

«3fpH6~8, 25~40°C, 5% C O 2 #STf©MTt" 1~7B HfT o * 



^rm^r* Sl^B ¥fiU 3 # 1 £ 50 

mm^=H 1 2 - 2 1 1 A 4 M: 18/ 57 

ffl^n"rv>5TNM-FHJ§l(b(7r-^>s;I>ttS^)^ Sf-900 II SFM^iife (7^7 
• ns?— XftgO v ExCelHOO, ExCell405|>m£>JRH;M :n>>> 

— Grace's Insect Medium[Nature, 195, 788 (1962)]^£J1 1^3 C t 

[ 0 0 5 3] 

i£«i=L iI^pH6~7, 25~30°C^©^T-£l~5BP e i mdo 
[ 0 0 5 4] 

^Sifcts il^pH5~9, 20~40°C©^T'£3~60S[!gfT5o 

tg#^^^^^-cJg*u, ises^^si^t, iMi^ias^ 

[ 0 0 5 5 ] 

)W > t><Dj5m[J. Biol. Chem., 264, 17619 (1989)], □ t> £ 0^&[Proc. 



&fcUB mi& 1 3^ 1 £ 5 B 
ll§f = H 1 2 - 2 1 1 A 4 H; 19/ 57 

Natl. Acad. Sci., USA, 86, 8227 (1989), Genes Develop., 4, 1288 (1990)] 
s £fc&^§fl¥05-336963, 4$§g¥O6-823O21^£g3«0#i££ip/H"t3 d 

[ 0 0 5 6 ] 

4*§g¥2-227075tClBtt^n-rv»-S*^K:SSSl;Ts ^ t h" n ^g&7Gg?jg 
it IS ^ ^ £ ffl V \ fc it ^ Jf [|>i ^ * *!| ffl L T £ 0 Ji £ ± # £ ■& 3 d h % £ S o 
[ 0 0 5 7 ] 

[ 0 0 5 8 ] 

[Am. J. Clin. Nut r., 63, 639S (1996), Am. J. Clin. Nutr., 63, 627S (1996 
) % Bio/Technology, 9, 830 (1991 )] £ m ^XMi^^mX bTiH/?£ LfcUj^J* 

o mmvt}tp<D£m ■ wmmmtLxit, mz.&, sst&tioou* (^b§63-309 



fettiB W-Ml 3%. 1 50 
jg3IS-t=H 12-211A4 H: 20/ 57 

[ 0 0 5 9 ] 

t%m, 20 (1994), ffimt&m, 21 (1995 K Trends Biotechnol . , 15, 45 (1997)] 
[0 0 6 0] 

MMzmfofrmiz x t) nusu 7kmmwML£.ttA,m&, m^tiLte&m. ?y%- 

[0 0 6 1 ] 

i»»s^ hp ^fstti^ i^ai.^^ Ji^iu tsiiiasa 

f&^U yx^j^if^ (DEAE) --b7 r n— DIAION HPA-75 (H^fb 
fifc#§!0«Ut>>£^fcl£^;*>£&*n v h^77-f-$, S-Sepharose FF 
(Pharmacia*t8i)^l©U^>SfflV'>fcB&^ n h^77 — ^ 

[ 0 0 6 2 ] 



lli? = H 1 2- 2 1 1 A 4 M: 21/ 57 

mmizmmz ±ta t mmommmm&iz x & rasas s sit § 

-2) o 

[ 0 0 6 3 ] 

#$s w©g a ft s v ^ at ^ iz 2 ft£ n-a-ic 

& , ^»_hm ic m e s » s v m± j& (D^mnm fame* mmik &®i®.tzzk.ifi-v 

[0 0 6 4] 

d <D X o \Z L T ^tll.ISf hits #!l x. ii\ IB^U*-^ 1 T- H £ ft £ T ^ 
y i?IE?ij7> & 3^ & H£ * (f S d h § £ o 

HItSCi:^f§l)o #iJx.te\ D^e,©^^[Proc. Natl. Acad. Sci., USA, 
86, 8227 (1989), Genes Develop., 4, 1288 (1990)], 4SH¥05-336963 N 
¥06-82302U3l3i6©^^{3m Dt> ##gBJ3©*'J KSr/n^-f y Ah<D 

[ 0 0 6 5 ] 

*fcx *3§BJ©M£JC£F 1 a g^^^ K^CDiBi«&gaHfc ifiF 
#3[Proc. Natl. Acad. Sci., USA, 86, 8227 (1989), Genes Develop., 4, 12 

88 (i990)] o stc, m*v^7-^ h*smizttT&faik%m^tzT7 j 

[ 0 0 6 6 ] 



tltfiB ¥^13^ IE 55 
F^f = H 1 2- 2 1 1 A4 g; 22/ 57 



*-;i/;fr=3F'>*;i/7i?r:;i,i£) N tBocS ( t - 7^)1* ti %$<D 
<b3^/ifc$££ «fc »K ##&W©g£K&*iJg-t<5 So Advanced C 

hemTech#> ^ — • ft, Pharmaciaft, Protein Technology Instrura 

entft, Synthecell-Vegaft, PerSeptiveft, A&Stf^ffi^©^:/^ K^fi£*S£jflJ 

[3] h-^WIiflcDM . 

o 

[0 0 6 7 ] 

J§«^©«ts«a^ u-rti. Jg*i*©a«s&k i&mffi<Dt&m%), mmm^m^ 
mm, mwfo<DMmmm%d, &&&&&& wfrmvo, &m<t<Dmfe4bm&zw± 

[ 0 0 6 8] 

*^&^©£j5JU3:fc^T^£ftSi£|fcafi{a\ 37"Ctfl»^tl m 
moKDtf?? b — *^|&H££jsfcr£Cli:©T£S?5tt£li!Ma (U) 
1 mU/1-1,000 U/lT&lK $?^U<{ilO mU/l~100 O/l0lSfffl^5, 

£©^h>iu ^-febr* F&h*©T5. K!i£&££&f:f 3£ h#T'£3o £fz, 

mmmt Lxm^tzm^^mmm^pkm^ii Lxm^z>zt&x%z>o 

[ 0 0 6 9 ] 

mt&&ffita LTfc«fc^o WMfe&ffltL LXl±, rt , J**isx.^uy • XPfTis 
fr7 5.> (0iJ;U±*:M ^->S-215, B#i*fli*tS0 fc^©#-f ;*>#^®*St£jlfiJ 



23/ 

^y^;i/75> (^UfcfHiST^ >FB, B#$4flt*tSi) ££GD=»:r$ >8 
— ;k 7'-feh>^ H^^;i/&£#w?>ft N st'f&o.i~50 mi/i©MJ^-efflv^ 

[ 0 0 7 0 ] 

h-x&Gmm<D&mB.j&i*7k&m<btp. ph 5~io s $?g;L<fctpH 6~ 
i 2 , Mgc 1 2 m<Dmmi&m%ffi}x}Tz>zziftT*gz>o 

*M4S£# * £ £ J# L # 7 £ r — * ^ * *t« © S fci D i onexM © *| ^ « & 
&£f£ffl^Tfr -5 5 [Anal. Biochem., 189, 151 (1990)]b 

[ 0 0 7 1 ] 

ttTi:*«iB© «1?'J £ ^ T ifi, BJ3 fcfc £ ti 6 H»J fig j£ $ n s & © & 
[ 0 0 7 2 ] 

[ 1 ] 7M/7h3y*^ • 7^7^^i Type I at* A 1 £>©3*J3I^*g££-i$3tl£ 
^CD^ D— - >^ ( 1 ) 

^M/T'hny*^ • T-fi^&f-x. Type Ia*fc£J. Bacterid . , 181, 5176 ( 
1999)I3@CD^^CJ; bigHLfco jg^SBtJc J: »3 ffii*:^^ Ufc^ #b>h 
• 7D hn-;i/X • > • ^L/^ra.^— • A-f # D — £ 13 ifc © # i£ £ fi£ o Ts 

[ 0 0 7 3 ] 



F^=H 12-211A4 M: 24/ 57 



^hl/7F^yjj7-7iJ7^fx Type III©*JK^*&£^fifcfcf^^& 3c 
P_sD3tfc^JI]?2©i&SiB?!J[Mol. Microbiol., 8, 843 (1993) ](cS^^TlSSt £ 

fe y h t. LT/BW -XYVzryzi vtiTs - T13^^^^- Type Ia^©|£ 
MDNA^iitbtPCR^fTofco PCRIiMDNA 0.1/*g, ^7 
-i — & 0.5/zmol/l, Takara LA Taq# U 7— t* (S?@?g#:§^)2.5 units, Tak 
ara LA Taq# U p< ^ — -{f ffiiUfnt 4/zK deoxyNTP^g- 200^mol/l £^<t?S*Sig 4 
0^l£fflV\ 94 o C-Cl^•^^42 o C'r27>P B ^72 o C-e3^P^CDXa^30IIl^^)?g■rc h 

[0 0 7 4] 

^M/^hay*^ • 7#^*>:n Type Ia»©&fefl;D N A&ftJKSP&EcoR 
I£ «fc D4&abfc»r#&pBluescriptII SK( + )£Jg]|g LTl«i^D N A£ft§^ U 

, K«aife^.^DNA*ffl^T^:jBffijMio9tts^Hteife-r-5^^{3«fc t) ^-r x^ 

m*,* >&ft\\ &\,^>?i-)l%;^??u — > H*£?mLfco 
iOMttSr?^? h*£pBA101i:fft;g U *©18j££fi¥#fLfcfcc:^ pB 
luescriptll SK( + )&C* h b 7 b zi y ti X • 7 # v ^7 f-x. Type Ia^©^-g#D 
NAfi*® 3.5kb©ifM-^^A^nfe^iaT-afeofc 0 «t D N A BNt© 

^IS?iJ£:#i£LfchC:3, 8&EDNAWr#-£& N J. Bacterid., 181, 5176 (19 
99)T'cpsIaF, cpsIaG :fe <fc£>" cpsIaH ^ifo£, 3 o©3fifs^ ^cpsIaEii^S^ 

S^t*DNAta&5C:i:#18g$nfc. fflRH4^H?©<g^, cpsIaE jt 

-f5^{i^;i/n-X!K^If^3t1S?s cpsIaG aHg^a/? 1 . 4 - X? ? h 

[2] • 7 is*? Type Ia»> £ ©^M#M££-/}£3Ifc 

-?©^7 d — - >^ ( 2 ) 



r-? = H 12-21 1A4 ^ • 25/ 



1© [1] tmbtltzT'^XK r*pBA101£#A«*ftT^£;* h h n 
vjj?, • TX^&f-x. Type Ia#cE&#© 3 . 5kb©8ff it £ 5 > A 7* 7 -f ^— DNA 

Hit L fco 

[ 0 0 7 5 ] 

^H/rhnyA^ • TlS'yZ?-*. Type IatfcO&fiffcD N A Sfclfiggp^Bgll 
KC £ &M3ILfc»r#£pBluescriptII SK(+)tMlTll^iftD N A^Ml 

x K«a^^#DNA*ffiv^T^lli®JMlO9t*s03Ste^^l■rsci:^c«t t) 

ar^** F£pBA103fctft£U *©#3£&#P#rLfc£c:^ pBluescriptll SK 
( + h yfc;* • 7ji=,>7 3-*. Type Iat©^#;D N Aft*® 3.1k 

bODNABf^tfA^tlfcfll^f&^fci S^fCcfcbl^ 3.1kb©DN A»fjt© 
^IB?iJS^^Lfei:C^ N gEDNAWfJttfi, cpslak cfisIaJi:^^ L fcitfc 
^^cpsIaHitfE^CD-g^^ttj^^^ pBA103(C^^tl-SDNAif>t«> pBAiOl 
£ U * $ ft S D N A m h # ± T' L X V ^ * D N A T? & £ d £ & N 

pBA103^^fc^M^M^^3liE^©-§|5©DNA^«*-r-?>r7^ ^ Ff$> 

s^iitfipjfflLfco mmmkmcDmm, cEsiaijte^(±/? 1 , 3-n-t 

»fc^fc^ffil^£^rcfc#fllS£ftfc[iai, J. Bacterid. , 181, 5176 
(1999)]. 

[3] 7, h • TiS^^^-x. Type I btt 1 , 3 —8*7? h 

1 CD [2] t?^e>tlfe7 , 7^ ^ KpBA103(C^A£ftT^£* h U 7* h 3 
• T'tf^^x. Type Ia^ft^© 3. lkb©®r>t >f -f x 7 — DNA 

[ 0 0 7 6 ] 



« XT 1 O O 1 1 A yl ftttlB 3^ 1^ 5B 

F-t = H 1 2 - 2 1 1 A 4 M: 26/ 57 



* h 1/7* h n • Type Ib*fc£ s J. Bactriol . , 181, 5176 

(1999)IBfg©^^(Ccfc "9±&#Lfco jS'O^^C J; Dl^Wfbfct, # L- > h 

XFl/7f3^7-7i37^?x Type Ib#fc©|££.#D N A £$iJPSgf IftBsII 
IfCcfc «SUfclfrfr£pBluescriptII SK( + )£#AU Ztl^AmMMlOWzm 

[ 0 0 7 7 ] 

±t^m$i ufe7*D— yjs^v^^ 7* -5- i J-^ut3D--^^yij y-r -te" 

;f 37'^;* 5. K£^ft^*ipBB102£ckt>*pBB103hi$£ U ^©^^^^/fUfe 
^^<5> -^n^tlpBluescriptir SK(+)t^ hb^hzay^ • Ty 
pe Ib^O^fe^D N AS^CO 5.5;fccfct>* 1 .4kbcD0§rJt>WA;* ftfc#I^T*& o .• 
(121 1 ) o 

[ 0 0 7 8 ] 

z.nt> 2 -OWN Amfriz^i^-c, mmzi£\^Mmm*&7£.i,fztz?> s m 

2 -D(D D N Airtf-iilfet^D N A±T"l^g UT#4bTiS t) ^ IE?"J#-!§ 3 X«m$ 
*IB?!jfr£>&3 DN A Kite, cpsIbE , cpsIbF , cpsIbG , cpsIbH , cpsIbK &<fc^ 

cp^ibj^^^^n-&3tfE^^#s-r^«i h^^jBjbfec *Bi^£tfc*t©ijg^ cpsi 

W&fc^&c£sla2MfcT&£V<7J\sZ]—xU&MmMfc^, cpsIbG itfc^&cpsI 
aGjifc^ 1 , 4 -#7^7 h-^$E^^^itf5^s cpslbl jitfc^& cpslal 

MB^^JiXf/Q 1 , 3 - N-^^^;i/*^^ Mt^ ym&Bmmte? tmmVttt 
mfr-DfzZtti^, fiB?iJ1l^3T"S£n3iMia?<Jfr £>&£>DNA&, 
b^vtiX ■ T-n^Z^x- Type Ibtt©3»5M#M^^e^g¥©-g|5^^^t- 
3 D N AT & £ £ #flgg£ nfco 
[ 0 0 7 9 ] 



w-mi 3^ i % 5 h 

mmm^r = K 1 2 - 2 1 1 A 4 M: 27/ 57 

^hU7*hrJ«y*^ -TiS^^^x. Type im^^Sit i^JSL^Wt&^^^X 
CEsIaJite^ii^ti^^^ ^6Dte^§fioTV>Sd^[J. Bactriol., 181, 
5176 (1999)], h U 7" h n >y*X • TU???-*- Type IaJfc^T; h 1/7' 

hziyfix • TfS^^^x. Type IbflrCtt&#7 * b—Z *tl*tl 
J8 1 , 4b./3 1, 3hi^ot^5Ci:[J. Bactriol., 181, 5176 (1999)]fr£ 

, cpsibJ itfe^(±/? i . 3 -nz>>? h-xm&mmfe<&&%tz>m&M*-3- k 
mmm 2.01, 3 -#7* b-Mmmmm^mMmco^m 

Ji— ter^-r ^ • ^*-<^-^^^AXtt^8905^DN A^#^fflV^T^•^!KUfe 
x iH^J#-^-6i5ctt>'7-e*^nSitSi3^J*^^€)DNA^ffl^-C, HlEfi»J 1 © 

[3] xm%zntzii^>7 Y—T.nmmmm^^tsD^ A®rjt£, tis©^ • 

[ 0 0 8 0] 

Type IbtOMftDNASiSiiUtPCRStT^to PCRIiMDNA 
O.l^g, 7"7^y-^ 0.5^mol/K Takara LA Taqtf U ^ ^ — Hf (Sigjg*tS02 
.5 units, Takara LA Taqtf U * ^ — -If JMtfnt deoxyNTP# 200/^mol/l 

£"£trRf&iS 40Atl&fflW 94°CT"l^P^^42 o C-C2^P^,72°C-r-3^P^©xa^30 

[ 0 0 8 1 ] 

I£S Jfoflfc© 1/ 10S £ 7- # D — * y ; VflM^I/j L , @ 6<J © if M" # Jf U T V ^ S Z 

t&mmik, m*) co fcfcffifr v—>u*y M/-w;t 101*110 

■TJffi»rJt£II!JKU f£DNACDTE[10 mmol/1 Tris-HCh 1 mmol/1 EDTA (pH 

8.o)] mmm 20ui*mtz o 

2.0 kb©DNA»r>T-SIIlJRUfco 
[ 0 0 8 2 ] 



filtHH ¥fiU 1 J§ 

: -?=H 1 2 - 2 1 1 A 4 M: 28/ 



pBluescriptll SK( + ) DNA 0.2//gS»JBB»^NotIi5 molXWm^ T 

t) 3.0 kb©DN A©fM-^l£]lKLfeo 
IS 2.0 kb*s«tV 3.0 kb©tBrJt£7^— *>3>*y h£fflv^Ts 16°CT-16B# 

[ 0 0 8 3 ] 

n ifIteM^7>^y'J> 50//g/ml£^&LBS5^1ft[M^ h hiJ7 h> 
(r^^tti) 10 g/U (foaafi) 10 g/U ^jbt h 'J ->A 

5 g/U £?U5g/l] 37°CT--TO«Lfeo 

*8fb£ «fc tJffllKLfc (1212 ) o , . ^ 

[ 0 0 8 4] 

^-tT'f'f :/ • jarisXr AXttM8905MD N A-£-fi£^£ffll^T 
L *^ IB#J#^- 7 ;fc«fc U* 8 ^£ ft 3 ^SIB^'Jfr f.ft5DNA§7"7^ y- — 
■fe y h h LTffll^ ^ h 1/ h 3 y • TiS^Z^-x- Type I b^GD D N 

Tit iH»f^*[llJ|RU iDNACTEjW 20/zl&*#fc o 
[ 0 0 8 5 ] 

5//l£flHv DNA£ftJP»m Ecolfc £0? XhoI-CtOWf L, 7#D 

t) 1.0 kbcDDNArr^SEIiRbfco 
pBluescriptll SK( + ) DNA 0.2/*g£$ij|ggfcH EcoRI:fc«fctf XholTf-EOBfgU 

ffl^T 3.0 kbGDDNABrfr&EUKUfco 
[0 0 8 6 ] 



^Si#-^ = H 1 2 - 2 1 1 A4 M: 29/ 

f£ 1.0 kb&J^tf 3.0 kb^SfM-S^'f -y— i/a b£fflV^T, 16°C-C16B3 

[0 0 8 7 ] 

^fbtct bfiisufc (02), 

^hl/T'hrJry*^ • Tfi^P^x. type Ib&*®/? 1 , 3 - US?*? h— ^$5 
iii?SttSttS^fiIS3-h"tSDNA§Mt57 , 7X^ FpBBPJ£{£ 
;£-f 3*fl§fi Escherihia coli JM109/pBBPJ(i, 1-2^12^21 Bf*tt"C 
v HftiKi^lfHSii^. (H*i«o < fc£rfj Jg 1TB IS 3-^; 
) tFERM BP- 7 4 0 0tltfflt^nt^S o 

CTff'J 2 T^f^n /^ Escherichia coli JM109/pBBPIJtt*5«tZJ c JM109/pBBPJ*** 
*n*n7>^>'J> 50#g/ml£"£irLBigilb 8 mlOAo fc*SK®@fc:SES 
t37°CT-17B#F B ^#Ufco &t&mm&7'>\fisV > 50#g/nil£^tf L Btgife 8 
ml OAofc*Sa»f{:l%il L37 0 Cr5B#|g|JgSI& % 1 mmol/1 4: & § «t a fc I 

SiSttfr i£[J. Biol. Chem;, 272. 19502 (1997), Mol. Microbiol., 

26, 197 (1997)]KftoTJBSEi#£il8!iLfco fgMlJ^&^K jfc b"t-80°C-e 

[0 0 8 8] 

g^«IHh&€>^ h — N — h U^-^IIii, 7^7 h-N-^^r h7*- 

7c*ig0^7^ 100 o CT"5#F B 1M3I-f S^lhfr J: u 

G--n^>7 H*i£t4£ SCI fc£ J: bUSKLfco 

JM109/pBBPIJt*J&Hifl'(200,ag/ml), 50 mmol/1 ^7 ;n> y y t- (pH 7.0) 



= ¥^13^ IE 5 
ll#f = H 1 2 - 2 1 1 A 4 M: 30/ 

, 5 mmol/1 MnC 1, N 10 mmol/1 ^ b-N- h U^ — 5 mmol/1 UD 

P h-J2fr£>&S 0.1 mlCDSjSiS*-^ 37°CT-72B$P E gS£S£fi : :ofco 

[ 0 0 8 9] 

s fcfo j txtt mm^n mm ( dx-5 oo ) & % v ^ t w 

T^f/r^tr^flr U> ^ffrrW^ 0.2 mmol/l<£>^ h -N-^ b^-** 5 
(BI^i{CJM109/pBBPJ^©Mli^^ffiV^fe^-^t3{±, 0.05 mmol/l<D^ b-N- 

ts -5 A : CarboPAC PA10 
?MfM : A ; H 2 0, B ; 500 mmol/1 NaOH 
V^ViLy h : 10-70% B in 20 min 

[ 0 0 9 0] 

[ 0 0 9 1 ] 
[IB3«:7 b] 

ia^J#-^4 : &ffi-D NA 
mm^5 : »NA 

IB^'JS-^ 6 : ^DNA 
IE^iJ#-^ 7 : ^DNA 
IE?iJ#-s§8 : »NA 



IR 50 
: 31/ 57 



= H 1 2 - 2 1 1 A 4 



[ 0 0 9 2 ] 

[Emm 

SEQUENCE LISTING 



<110> KYOWA HAKKO KOGYO CO., LTD. 



<120> Beta 1,3-galactosyltransferase and a DNA coding for said enzyme 
<130> H12-2111A4 



<140> 
<141> 

<160> 8 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 

<212> PRT 

<213> Streptococcus agalactiae Type lb 



<400> 1 

Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn Tyr 
15 10 15 



Val Arg Asp Ser Val Glu Ser lie Leu Asn Gin Thr Leu Thr Asp Phe 
20 25 30 



■■^ = U 1 2 — 2 1 1 A 4 



H: 32/ 57 



Glu Phe He He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys Gin 
35 40 45 

Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu Leu 
50 55 60 

Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val Lys 
65 70 75 80 

He Ser Lys Gly Glu Tyr He Phe Arg Met Asp Ala Asp Asp He Ser 
85 90 95 

Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn Ser 
100 105 110 

Leu Asp Phe Ser Ala Thr Leu He Glu Leu He Asp Gin Lys Gly Asn 
115 120 125 

Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn Asp 
130 135 140 

He Arg Lys Met Leu Leu Asn Arg Ser He Leu Ala His Pro Thr Trp 
145 150 155 160 



Cys Val Lys Lys Lys Val Phe Asp Lys Leu Met Gly Tyr Arg Asp Leu 
165 170 175 



Val Pro Val Glu Asp Tyr Asp Phe Ala He Arg Gly Ala Leu Ala Asp 
180 185 190 



: -t = H 1 2 - 2 1 1 A 4 



iitHB ¥fiSl3# 1J§ 5B 
H: 33/ 57 



Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu Asn 
195 200 205 

Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr Ser 
210 215 220 

Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp He Thr 
225 230 235 240 

Lys He Thr Asn Tyr Phe Gin Glu Tyr Val He Lys Lys Arg Tyr Thr 
245 250 255 

Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser lie 
260 265 270 

Thr lie Arg Lys Leu Tyr He Cys Leu Tyr Leu Tyr Phe Lys Ser Pro 
275 280 285 

Leu Val Arg Arg Leu Leu He Asn Asp He Asn He Leu Val Leu Lys 
290 295 300 

Leu Phe Gly Gly Glu Lys Gin Ser Asp 
305 310 313 



<210> 2 
<211> 

<212> DNA 

<213> Streptococcus agalactiae Type lb 



=H 1 2 - 2 1 1 A 4 



tltUB ¥/?U 3^ 1J§ 5B 
H : 34/ 57 



<400> 2 

atg aat tat agt ate att atg teg gta tat aat gag cct tta aat tat 48 

Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn Tyr 
15 10 15 

gtg aga gat tea gta gaa tct ata tta aat caa acg ctt act gat ttt 96 
Val Arg Asp Ser Val Glu Ser He Leu Asn Gin Thr. Leu Thr Asp Phe 
20 25 30 

gag ttc ata att gtc att gat aat cca agt aga ggt gat tta aag caa 144 
Glu Phe He He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys Gin 
35 40 45 

ttc tta aca gaa tat tcagtt gta gat aat aga ata aaa ate ttg ctt 192 
Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu Leu 
50 55 60 

aat gaa gaa aat att ggt tta gca tea agt ttg aac aaa gcg gtg aaa 240 
Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val Lys 
65 70 75 80 

att tct aag gga gaa tat att ttt aga atg gat get gat gat att tea 288 
He Ser Lys Gly Glu Tyr He Phe Arg Met Asp Ala Asp Asp He Ser 
85 90 95 

tat cca agt aga ttt gat aag caa att cgt ttt atg gag gaa aat tea 336 
Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn Ser 
100 105 110 



F-? = H 1 2 - 2 1 1 A 4 



titfciB 1 3^ 



1£ 5 H 
: 35/ 57 



ttg gat ttc tea gca act eta ata gaa ttg ata gac caa aaa gga aat 384 
Leu Asp Phe Ser Ala Thr Leu He Glu Leu He Asp Gin Lys Gly Asn 
115 120 125 



tta gta tat aaa caa cga gaa agt aat aaa ata tac tta act aat gat 432 
Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn Asp 
130 135 140 



ata egg aag atg tta ttg aat aga tct 
He Arg Lys Met Leu Leu Asn Arg Ser 
145 150 

tgc gta aaa aag aaa gtt ttc gat aag 
Cys Val Lys Lys Lys Val Phe Asp Lys 
165 

gta cct gtt gaa gat tat gat ttt gca 
Val Pro Val Glu Asp Tyr Asp Phe Ala 
180 185 



ata ctt gee cac cca acg tgg 480 
lie Leu Ala His Pro Thr Trp 
155 160 

tta atg gga tat aga gat tta 528 
Leu Met Gly Tyr Arg Asp Leu 
170 175 

ata aga gga get ctg get gat 576 
He Arg Gly Ala Leu Ala Asp 
190 



ttc aaa ate ggc tta etc aat aaa gta ctt tta cag tat aga tta aac 624 
Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu Asn 
195 200 205 



gag aat gga ata tea caa ace aat aag ttt aag caa tat att tac tea 
Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr Ser 
210 215 220 



672 



r^=H 1 2 - 2 1 1 A 4 



mtUB 1 3^ 1 £ 5 B 
M: 36/ 57 



get att tta caa gat ttt tat aaa gaa aaa tct tat att gat ate aca 
Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp He Thr 
225 230 235 240 



720 



aaa att act aat tac ttt caa gag tat gtg ata aag aaa cgc tat act 
Lys lie Thr Asn Tyr Phe Gin Glu Tyr Val lie Lys Lys Arg Tyr Thr 
245 250 255 



768 



cag caa gag etc tct aaa tat ttt gag eta aaa tct acc cct agt att 
Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser He 
260 265 270 



816 



act att aga aaa eta tat att tgt tta tat tta tac ttt aag tct ccc 
Thr He Arg Lys Leu Tyr He Cys Leu Tyr Leu Tyr Phe Lys Ser Pro 
275 280 285 



864 



ttg gtt agg agg tta tta ata aat gat att aat att tta gta ctg aaa 
Leu Val Arg Arg Leu Leu He Asn Asp He Asn He Leu Val Leu Lys 
290 295 300 



912 



ttg ttt gga gga gag aaa caa agt gac 
Leu Phe Gly Gly Glu Lys Gin Ser Asp 
305 310 



939 



<210> 3 
<211> 6865 
<212> DNA 

<213> Streptococcus agalactiae Type lb 



najB ¥m 1 3^ 1^ 5b 

M : 37/ 57 



g|3j#-if=H 1 2 - 2 1 1 A 4 



<220> 
<221> CDS 

<222> (617). .(1789) 



<220> 
<221> CDS 

<222> (1816).. (2262) 



<220> 
<221> CDS 

<222> (2265).. (2744) 

<220> 
<221> CDS 

<222> (2843).. (3979) 



<220> 
<221> CDS 

<222> (3982).. (4953) 



<220> 
<221> CDS 

<222> (5009).. (5947) 



<400> 3 

agatcttgga gatattatct gtgaaaccaa tgttcctaga ctgatggtcg ttccttcagg 60 
gaaagtacca ccaaatccaa cagcattact tcagaacgct tattttaata agatgattga 120 



HfcHB 1 3# 1£ 50 
^ = H 1 2- 2 1 1 A4 H; 38/ 57 



agctattaaa aatatatttg attatattat catcgatact ccacctattg gtttagttgt 180 

tgatgccgca ataatcgcta atgcttgcga tggttttatt ttagtaaccc aagcaggtag 240 

aataaaacgt aattatgttg aaaaagcaaa agaacagatg gaacaaagtg gttcaaagtt 300 

cttaggtatt attcttaata aagttaatga atctgttgct acttacggcg attatggaaa 360 

ttacggaaaa agggatagaa aaaggaagta aggggctctt gtattgaaag aaaaagaaaa 420 

tatacaaaag attattatag cgatgattca aaccgttgtg gtttattttt ctgcaagttt 480 

gacattaaca ttaattactc ccaactttaa aagcaataaa gatttattgt ttgttctatt 540 

gatacattat attgtctttt atctttctga tttttacaga gacttttgga gtcgtggcta 600 

tcttgaagag tttaaa atg gta ttg aaa tac age ttt tac tat att ttc ata 652 

Met Val Leu Lys Tyr Ser Phe Tyr Tyr He Phe He 
1 5 10 

tea agt tea tta ttt ttt att tct aaa aac tct ttt aca acg aca cga 700 
Ser Ser Ser Leu Phe Phe lie Ser Lys Asn Ser Phe Thr Thr Thr Arg 
15 20 25 

ctt tec ttt ttt act ttt att get atg aat teg att tta tta tat eta 748 
Leu Ser Phe Phe Thr Phe He Ala Met Asn Ser He Leu Leu Tyr Leu 
30 35 40 



ti=Hl 2 - 2 1 1 A 4 



mmH 1 3if 1 £ 5 B 
H: 39/ 57 



ttg aat tea ttt tta aaa tat tat cga aaa tat tct tac get aag ttt 
Leu Asn Ser Phe Leu Lys Tyr Tyr Arg Lys Tyr Ser Tyr Ala Lys Phe 
45 50 55 60 



796 



tea cga gat ace aaa gtt gtt ttg ata acg aat aag gat tct tta tea 
Ser Arg Asp Thr Lys Val Val Leu He Thr Asn Lys Asp Ser Leu Ser 
65 70 .75 



844 



aaa atg acc ttt agg aat aaa tac gac cat aat tat ate get gtc tgt 892 
Lys Met Thr Phe Arg Asn Lys Tyr Asp His Asn Tyr He Ala Val Cys 
80 85 90 



ate ttg gat tec tct gaa aag gat tgt tat gat ttg aaa cat aac teg 940 

He Leu Asp Ser Ser Glu Lys Asp Cys Tyr Asp Leu Lys His Asn Ser 
95 100 105 

tta agg ata ata aac aaa gat get ctt act tea gag tta acc tgc tta 988 

Leu Arg He He Asn Lys Asp Ala Leu Thr Ser Glu Leu Thr Cys Leu 
110 115 120 



act gtt gat caa get ttt att aac ata ccc att gaa tta ttt ggt aaa 1036 
Thr Val Asp Gin Ala Phe He Asn He Pro He Glu Leu Phe Gly Lys 
125 130 135 140 



tac caa ata caa gat att att aat gac 
Tyr Gin He Gin Asp He lie Asn Asp 
145 



att gaa gca atg gga gtg att 1084 
lie Glu Ala Met Gly Val He 
150 155 



gtc aat gtt aat gta gag gca ctt age ttt gat aat ata gga gaa aag 



1132 



: -? = H 1 2 - 2 1 1 A 4 



attm w-m 1 3^ 



13 5B 
: 40/ 57 



Val Asn Val Asn Val Glu Ala Leu Ser Phe Asp Asn He Gly Glu Lys 
160 165 170 

cga ate caa act ttt gaa gga tat agt gtt att aca tat tct atg aaa 1180 
Arg He Gin Thr Phe Glu Gly Tyr Ser Val He Thr Tyr Ser Met Lys 
175 180 185 

ttc tat aaa tat agt cac ctt ata gca aaa cga ttt ttg gat ate atg 1228 
Phe Tyr Lys Tyr Ser His Leu He Ala Lys Arg Phe Leu Asp He Met 
190 195 200 

ggt get att ata ggt ttg etc ata tgt ggc att gtg gca att ttt eta 1276 
Gly Ala He He Gly Leu Leu He Cys Gly He Val Ala He Phe Leu 
205 210 215 220 

gtt ccg caa ate aga aaa gat ggt gga ccg get ate ttt tct caa aat 1324 
Val Pro Gin He Arg Lys Asp Gly Gly Pro Ala He Phe Ser Gin Asn 
225 230 235 

aga gta ggt cgt aat ggt agg att ttt aga ttc tat aaa ttc aga tea 1372 
Arg Val Gly Arg Asn Gly Arg He Phe Arg Phe Tyr Lys Phe Arg Ser 
240 245 250 



atg cga gta gat gca gaa caa att aag aaa gat tta tta gtt cac aat 1420 
Met Arg Val Asp Ala Glu Gin He Lys Lys Asp Leu Leu Val His Asn 
255 260 265 



caa atg acg ggg eta atg ttt aag tta gac gat gat cct aga att act 
Gin Met Thr Gly Leu Met Phe Lys Leu Asp Asp Asp Pro Arg He Thr 



1468 



F-^ = H 12-211A4 g : 41/ 57 



270 275 280 

aaa ata gga aaa ttt att cga aaa aca age ata gat gag ttg cct caa 1516 
Lys He Gly Lys Phe He Arg Lys Thr Ser He Asp Glu Leu Pro Gin 
285 290 295 300 

ttc tat aat gtt tta aaa ggt gat atg agt tta gta gga aca cgc cct 1564 
Phe Tyr Asn Val Leu Lys Gly Asp Met Ser Leu Val Gly Thr Arg Pro 
305 310 315 

ccc aca gtt gat gaa tat gaa aag tat aat tea acg cag aag cga cgc 1612 
Pro Thr Val Asp Glu Tyr Glu Lys Tyr Asn Ser Thr. Gin Lys Arg Arg 
320 325 330 

ctt agt ttt aag cca gga ate act ggt ttg tgg caa ata tct ggt aga 1660 
Leu Ser Phe Lys Pro Gly He Thr Gly Leu Trp Gin He Ser Gly Arg 
335 340 345 

aat aat att act gat ttt gat gaa ate gta aag tta gat gtt caa tat 1708 
Asn Asn He Thr Asp Phe Asp Glu He Val Lys Leu Asp Val Gin Tyr 
350 355 360 



ate aat gaa tgg tct att tgg tea gat att aag att att etc eta acg 
He Asn Glu Trp Ser He Trp Ser Asp He Lys He He Leu Leu Thr 
365 370 375 380 



1756 



eta aag gta gtt tta etc ggg aca gga get aag taaaggtaag gtttgaaagg 1809 
Leu Lys Val Val Leu Leu Gly Thr Gly Ala Lys 
385 390 



>-?=H 1 2 - 2 1 1 A 4 



jftBB W-m 13^ 1 M 5 B 
H: 42/ 57 



aatata atg aaa att tgt ctg gtt ggt tea agt ggt ggt cac eta gca 
Met Lys He Cys Leu Val Gly Ser Ser Gly Gly His Leu Ala 
395 400 405 



1857 



cac ttg aac ctt ttg aaa ccc att tgg gaa aaa gaa gat agg ttt tgg 1905 
His Leu Asn Leu Leu Lys Pro He Trp Glu Lys Glu Asp Arg Phe Trp 
410 415 420 

gta act ttt gat aaa gaa gat get agg agt att eta aga gaa gag att 1953 
Val Thr Phe Asp Lys Glu Asp Ala Arg Ser lie Leu Arg Glu Glu He 
425 430 435 

gta tat cat tgc ttc ttt cca aca aac cgt aat gtc aaa aac ttg gta 2001 
Val Tyr His Cys Phe Phe Pro Thr Asn Arg Asn Val Lys Asn Leu Val 
440 445 450 

aaa aat act att eta get ttt aag gtc ctt aga aaa gaa aga cca gat 2049 
Lys Asn Thr He Leu Ala Phe Lys Val Leu Arg Lys Glu Arg Pro Asp 
455 460 465 

gtt ate ata tea tct ggt gee get gta gca gta cca ttc ttt tat att 2097 
Val He He Ser Ser Gly Ala Ala Val Ala Val Pro Phe Phe Tyr He 
470 475 480 485 



ggt aag tta ttt ggc tgt aag ace gtt tat ata gag gtt ttc gac agg 
Gly Lys Leu Phe Gly Cys Lys Thr Val Tyr He Glu Val Phe Asp Arg 
490 495 500 



2145 



: -? = H 1 2 - 2 1 1 A4 



tlfcUB 3^ IE 5 
M: 43/ 



ata gat aaa cca act ttg aca gga aaa tta gtg tat cct gta aca gat 
He Asp Lys Pro Thr Leu Thr Gly Lys Leu Val Tyr Pro Val Thr Asp 
505 510 515 



2193 



aaa ttt att gtt cag tgg gaa gaa atg aaa aaa gtt tat cct aag gca 2241 
Lys Phe He Val Gin Trp Glu Glu Met Lys Lys Val Tyr Pro Lys Ala 
520 525 530 

att aat tta gga gga att ttt ta atg att ttt gtc aca gta ggg aca 2288 
He Asn Leu Gly Gly He Phe Met He Phe Val Thr Val Gly Thr 
535 540 545 



cat gaa cag cag ttc aac cgt ctt att aaa gaa gtt gat aga tta aaa 2336 •=?-. 
His Glu Gin Gin Phe Asn Arg Leu He Lys Glu Val Asp Arg Leu Lys 
550 555 560 

ggg aca ggt get att gat caa gaa gtg ttc att caa acg ggt tac tea 2384 
Gly Thr Gly Ala He Asp Gin Glu Val Phe He Gin Thr Gly Tyr Ser 
565 570 575 580 

gac ttt gaa cct cag aat tgt cag tgg tea aaa ttt etc tea tat gat 2432 
Asp Phe Glu Pro Gin Asn Cys Gin Trp Ser Lys Phe Leu Ser Tyr Asp 
585 590 595 



gat atg aac tct tac atg aaa gaa get gag att gtt ate aca cac ggc 2480 
Asp Met Asn Ser Tyr Met Lys Glu Ala Glu He Val He Thr His Gly 
600 605 610 



ggt cca gca acg ttt atg aat gca gtt tct aaa ggg aaa aaa act att 



2528 



m&n 3^ ir 5h 

ili^ = H 1 2- 2 1 1 A4 H: 44/ 57 

Gly Pro Ala Thr Phe Met Asn Ala Val Ser Lys Gly Lys Lys Thr He 
615 620 625 

gtg gtt cct aga caa gaa cag ttt gga gag cat gtg aat aat cat cag 2576 
Val Val Pro Arg Gin Glu Gin Phe Gly Glu His Val Asn Asn His Gin 
630 635 640 

gtg gat ttt ttg aaa gag tta ttc ttg aaa tat gag tta gat tat att '2624 
Val Asp Phe Leu Lys Glu Leu Phe Leu Lys Tyr Glu Leu Asp Tyr He 
645 650 655 660 

ttg aat ate agt gaa tta gag aat att att aag gaa aaa aat ata tct 2672 
Leu Asn He Ser Glu Leu Glu Asn He He Lys Glu Lys Asn He Ser ^ 
665 670 675 

act agt aaa gta ata tea caa aac aat gat ttt tgt tec tct ttc aaa 2720 
Thr Ser Lys Val lie Ser Gin Asn Asn Asp Phe Cys Ser Ser Phe Lys 
680 685 690 

aat gaa ctt tct aaa eta ttt gaa taaatatatt ttgttggaga aaaaaattga 2774 
Asn Glu Leu Ser Lys Leu Phe Glu 
695 700 

aattaactat caatccaaag tatttgttaa taggaggaat tttegcttta accctatttt 2834 



caaageca atg caa ctt ttg tta ctt tta gca tta ata gtt tta ctt att 2884 
Met Gin Leu Leu Leu Leu Leu Ala Leu He Val Leu Leu He 
705 710 



jg-3 = H 1 2 - 2 1 1 A 4 



H: 45/ 57 



tgt agt agt tat aat gaa aaa atg aaa ttt tta aat atg get gaa att 
Cys Ser Ser Tyr Asn Glu Lys Met Lys Phe Leu Asn Met Ala Glu He 
715 720 725 730 



2932 



ttt ttc att gta ttt tat atg gtt tat tta gta tea ata gta tta aat 
Phe Phe He Val Phe Tyr Met Val Tyr Leu Val Ser He Val Leu Asn 
735 740 745 



2980 



teg tta ttt aga agt cca gaa ttt cat aga gtc att get gca ttc aat 
Ser Leu Phe Arg Ser Pro Glu Phe His Arg Val He Ala Ala Phe Asn 
750 755 760 



3028 



tea ctg gca gta ggg gtt gtg tec tta tta ttt tac cat tac tat aag. 
Ser Leu Ala Val Gly Val Val Ser Leu Leu Phe Tyr His Tyr Tyr Lys 
765 770 775 



30 IS 



aat act aat att gaa tta aca aaa ttg eta aaa tea ttt ttg ttt aat 3124 
Asn Thr Asn He Glu Leu Thr Lys Leu Leu Lys Ser Phe Leu Phe Asn 
780 785 790 



gca att att ttg ttt tgt tta gga ttt eta tat tat tat gec ata tat 3172 
Ala He He Leu Phe Cys Leu Gly Phe Leu Tyr Tyr Tyr Ala He Tyr 
795 800 805 810 



ttt gat gta gag aat gta agt ctt ttt gga aga aat tta att gga tea 3220 
Phe Asp Val Glu Asn Val Ser Leu Phe Gly Arg Asn Leu He Gly Ser 
815 820 825 



gat tgg ata aat ggg atg cat acg cag aga gca atg get ttc ttt gaa 



3268 



F-t=H 1 2 - 2 1 1 A 4 



1R 5 
: 46/ 



Asp Trp He Asn Gly Met His Thr Gin Arg Ala Met Ala Phe Phe Glu 
830 835 840 



tat tea aat ctt ata ata ccc tta act ate ata act aat ata tat ata 3316 
Tyr Ser Asn Leu He He Pro Leu Thr He He Thr Asn lie Tyr He 
845 850 855 

tat ata tat att aag caa aga tat age tea ggg atg atg ata etc ggt 3364 
Tyr He Tyr He Lys Gin Arg Tyr Ser Ser Gly Met Met He Leu Gly 
860 865 870 



get ctt etc tec act att ata eta ccc ate ggg tct gga tct aga get 3412 
Ala Leu Leu Ser Thr He He Leu Pro He Gly Ser Gly Ser Arg Ala 
875 880 - 885 890 



ggt att ata gtt gtg eta eta cag gtt ata att tta ttg ttg aat aca 3460 
Gly lie He Val Val Leu Leu Gin Val He He Leu Leu Leu Asn Thr 
895 900 905 



att gta ata aaa aga caa acg ata aga ttt ttc ctg tat tta gtt ccg 3508 

He Val He Lys Arg Gin Thr He Arg Phe Phe Leu Tyr Leu Val Pro 
910 915 920 

ata eta ata tta eta tta gtg ata tta cgt ttt gat aat ttg gtg age 3556 

He Leu He Leu Leu Leu Val He Leu Arg Phe Asp Asn Leu Val Ser 
925 930 935 

ata tat aat aga ata ate aat ttg egg teg gga agt agt gaa tct aga 3604 

He Tyr Asn Arg He He Asn Leu Arg Ser Gly Ser Ser Glu Ser Arg 



tittm w-m 1 3^ 1 e ■ 5 a 
mmm^= h 1 2- 2 1 1 a 4 m 47/ 57 

940 945 950 

ttt tct ttg tac aag gat acc gta cac tea gta att act gac tea eta 3652 
Phe Ser Leu Tyr Lys Asp Thr Val His Ser Val He Thr Asp Ser Leu 
955 960 965 970 



ttt ctg gga aaa ggt gta aaa gaa ttg tgg tta aat agt gat tta cca 
Phe Leu Gly Lys Gly Val Lys Glu Leu Trp Leu Asn Ser Asp Leu Pro 
975 980 985 



3700 



eta gga teg cat teg acc tac ata ggt tat ttc tat aaa act ggc eta 
Leu Gly Ser His Ser Thr Tyr He Gly Tyr Phe Tyr Lys Thr Gly Leu 
990 995 1000 



3748 



ttt gga eta ata aat gtg att tta ggt ttg ttt eta att ctt att age 3796 
Phe Gly Leu He Asn Val He Leu Gly Leu Phe Leu He Leu He Ser 
1005 1010 1015 



att ate aag gaa get aaa aag tea gat ttc tat tat gag ata gta ggg 3844 

He He Lys Glu Ala Lys Lys Ser Asp Phe Tyr Tyr Glu lie Val Gly 
1020 1025 1030 

tct gtc ata etc eta ttt tea ttt ttt gca ctt gaa gat att gat ggc 3892 

Ser Val He Leu Leu Phe Ser Phe Phe Ala Leu Glu Asp He Asp Gly 
1035 1040 1045 1050 



gec aat tgg etc att att ttt gtc ttt aca gtg ttg gga att tta gaa 
Ala Asn Trp Leu lie He Phe Val Phe Thr Val Leu Gly lie Leu Glu 
1055 1060 1065 



3940 



mm^=H 1 2 - 2 1 1 A 4 



tsrns 1 3^ i^ 5 □ 

M : 48/ 57 



aat aag gat ttc tat agt caa ctt aaa agg tgg gaa agt ta atg gaa 3987 
Asn Lys Asp Phe Tyr Ser Gin Leu Lys Arg Trp Glu Ser Met Glu 
1070 1075 1080 



aaa caa ata ctt gtt tct ate gtt ata cct ata tac aac teg gaa gca 4035 

Lys Gin He Leu Val Ser He Val He Pro He Tyr Asn Ser Glu Ala 
1085 1090 1095 

tat ctt aaa gaa tgc gtg caa tec gtc eta caa cag act cat tea ttg 4083 

Tyr Leu Lys Glu Cys Val Gin Ser Val Leu Gin Gin Thr His Ser Leu 
1100 1105 1110 

ata gaa gtt ata ctg att aat gat gga tec act gat aat agt gga gaa 4131 

He Glu Val He Leu He Asn Asp Gly Ser Thr Asp Asn Ser Gly Glu 
1115 1120 1125 



att tgt gat aat tta tct caa aaa gac gat cgc ata ctt gta ttt cat 4179 
He Cys Asp Asn Leu Ser Gin Lys Asp Asp Arg He Leu Val Phe His 
1130 1135 1140 1145 



aaa aaa aat gga ggg gta tct teg gca agg aac eta ggt ctt gat aaa 4227 
Lys Lys Asn Gly Gly Val Ser Ser Ala Arg Asn Leu Gly Leu Asp Lys 
1150 1155 1160 



tec aca ggc gaa ttc ata acg ttt gta gat agt gat gat ttt gta gca 
Ser Thr Gly Glu Phe He Thr Phe Val Asp Ser Asp Asp Phe Val Ala 
1165 1170 1175 



4275 



F-? = H 12-21 1 A 4 



ffittiB ¥ffi 1 3^ 1 J§ 5 0 
M: 49/ 57 



ccg aat ata att gaa ata atg tta aaa aat tta ate act gag gat get 
Pro Asn He He Glu He Met Leu Lys Asn Leu lie Thr Glu Asp Ala 
1180 1185 1190 



4323 



gat ata gca gaa gta gat ttt gat att teg aat gag aga gat tat aga 4371 
Asp He Ala Glu Val Asp Phe Asp He Ser Asn Glu Arg Asp Tyr Arg 
1195 1200 1205 



aag aaa aaa aga cga aac ttt tat aag gtc ttt aaa aac aat aat tct 4419 
Lys Lys Lys Arg Arg Asn Phe Tyr Lys Val Phe Lys Asn Asn Asn Ser 
1210 1215 1220 1225 

tta aaa gaa ttt tta tea ggt aat aga gtg gaa aat att gtt tgt aca 4467 & 
Leu Lys Glu Phe Leu Ser Gly Asn Arg Val Glu Asn He Val Cys Thr 
1230 1235 1240 

aaa tta tat aaa aaa agt ata att ggt aac ttg agg ttt gat gag aat 4515 
Lys Leu Tyr Lys Lys Ser He He Gly Asn Leu Arg Phe Asp Glu Asn 
1245 1250 1255 

tta aaa att ggt gag gat tta ctt ttt aat tgt aaa att tta tgt caa 4563 
Leu Lys He Gly Glu Asp Leu Leu Phe Asn Cys Lys He Leu Cys Gin 
1260 1265 1270 

gag cac tgc ata gtc gta gat acg act tct tec ttg tac acc tat cgc 4611 
Glu His Cys He Val Val Asp Thr Thr Ser Ser Leu Tyr Thr Tyr Arg 
1275 1280 1285 



ate gta aag act tct gca atg aat cag gag ttc aac gaa aat tea tta 



4659 



F-f = H 1 2 - 2 1 1 A 4 



H: 50/ 57 



He Val Lys Thr Ser Ala Met Asn Gin Glu Phe Asn Glu Asn Ser Leu 
1290 1295 1300 1305 



gat ttt ata aca att ttt aat gaa ata age agt att gtt cct gca aaa 
Asp Phe He Thr He Phe Asn Glu He Ser Ser He Val Pro Ala Lys 
1310 1315 1320 



4707 



tta get aat tat gtt gaa gcg aaa ttt tta aga gaa aag gta aag tgt 
Leu Ala Asn Tyr Val Glu Ala Lys Phe Leu Arg Glu Lys Val Lys Cys 
1325 1330 1335 



4755 



etc cga aaa atg ttt gaa tta ggt agt aat att gac agt aaa ate aaa 4803 
Leu Arg Lys Met Phe Glu Leu Gly Ser Asn He Asp Ser Lys He Lys * 
1340 1345 1350 



tta caa cga gag att ttt ttc aaa gat gtt aaa tta tac cct ttc tat 
Leu Gin Arg Glu He Phe Phe Lys Asp Val Lys Leu Tyr Pro Phe Tyr 
1355 1360 1365 



4851 



aaa gcg gtt aag tac tta tea tta aag gga tta ttg agt att tac tta 4899 
Lys Ala Val Lys Tyr Leu Ser Leu Lys Gly Leu Leu Ser He Tyr Leu 
1370 1375 1380 1385 

atg aaa tgt tea ccc ate ttg tat ata aaa tta tat gac agg ttt caa 4947 
Met Lys Cys Ser Pro He Leu Tyr He Lys Leu Tyr Asp Arg Phe Gin 
1390 1395 1400 

aaa cag taagtaatca aaaattaaat taactcaatt accttttaaa ttataggagt 5003 
Lys Gin 



F-*f = H 1 2 - 2 1 1 A 4 



tima ¥mi 3^ ir 5 a 

M: 51/ 57 



tgaaa atg aat tat agt ate att atg teg gta tat aat gag cct tta aat 5053 
Met Asn Tyr Ser He He Met Ser Val Tyr Asn Glu Pro Leu Asn 
1405 1410 1415 

tat gtg aga gat tea gta gaa tct ata tta aat caa acg ctt act gat 5101 
Tyr Val Arg Asp Ser Val Glu Ser He Leu Asn Gin Thr Leu Thr Asp 
1420 . 1425 1430 

ttt gag ttc ata att gtc att gat aat cca agt aga ggt gat tta aag 5149 
Phe Glu Phe He He Val He Asp Asn Pro Ser Arg Gly Asp Leu Lys 
1435 1440 1445 1450 

caa ttc tta aca gaa tat tea gtt gta gat aat aga ata aaa ate ttg 5197 
Gin Phe Leu Thr Glu Tyr Ser Val Val Asp Asn Arg He Lys He Leu 
1455 1460 1465 

ctt aat gaa gaa aat att ggt tta gca tea agt ttg aac aaa gcg gtg 5245 
Leu Asn Glu Glu Asn He Gly Leu Ala Ser Ser Leu Asn Lys Ala Val 
1470 1475 .1480 



aaa att tct aag gga gaa tat att ttt aga atg gat get gat gat att 5293 
Lys He Ser Lys Gly Glu Tyr He Phe Arg Met Asp Ala Asp Asp He 
1485 1490 1495 



tea tat cca agt aga ttt gat aag caa att cgt ttt atg gag gaa aat 
Ser Tyr Pro Ser Arg Phe Asp Lys Gin He Arg Phe Met Glu Glu Asn 
1500 1505 1510 



5341 



F-Sf = H 1 2 - 2 1 1 A4 



ISttiH ¥/3&l 1 H 5 0 
K: 52/ 57 



tea ttg gat ttc tea gca act eta ata gaa ttg ata gac caa aaa gga 
Ser Leu Asp Phe Ser Ala Thr Leu lie Glu Leu He Asp Gin Lys Gly 
1515 1520 1525 1530 



5389 



aat tta gta tat aaa caa cga gaa agt aat aaa ata tac tta act aat 
Asn Leu Val Tyr Lys Gin Arg Glu Ser Asn Lys He Tyr Leu Thr Asn 
. 1535 1540 1545 



5437 



gat ata egg aag atg tta ttg aat aga tct ata ctt gec cac cca acg 
Asp He Arg Lys Met Leu Leu Asn Arg Ser He Leu Ala His Pro Thr 
1550 1555 1560 



5485 



tgg tgc gta aaa aag aaa gtt ttc gat aag tta atg gga tat aga gat 
Trp Cys Val Lys Lys Lys Val Phe Asp Lys Leu Met Gly Tyr Arg Asp 
1565 1570 1575 



5533 



tta gta cct gtt gaa gat tat gat ttt gca ata aga gga get ctg get 
Leu Val Pro Val Glu Asp Tyr Asp Phe Ala He Arg Gly Ala Leu Ala 
1580 1585 1590 



5581 



gat ttc aaa ate ggc tta etc aat aaa gta ctt tta cag tat aga tta 
Asp Phe Lys He Gly Leu Leu Asn Lys Val Leu Leu Gin Tyr Arg Leu 
1595 1600 1605 1610 



5629 



aac gag aat gga ata tea caa acc aat aag ttt aag caa tat att tac 
Asn Glu Asn Gly He Ser Gin Thr Asn Lys Phe Lys Gin Tyr He Tyr 
1615 1620 1625 



5677 



tea get att tta caa gat ttt tat aaa gaa aaa tct tat att gat ate 



5725 



= -? = H 1 2- 2 1 1 A4 



SttlB 1 3^ 



1M 5 B 
: 53/ 57 



Ser Ala He Leu Gin Asp Phe Tyr Lys Glu Lys Ser Tyr He Asp He 
1630 1635 1640 



aca aaa att act aat tac ttt caa gag tat gtg ata aag aaa cgc tat 
Thr Lys He Thr Asn Tyr Phe Gin Glu Tyr Val He Lys Lys Arg Tyr 
1645 1650 1655 



5773 



act cag caa gag etc tct aaa tat ttt gag eta aaa tct ace cct agt 5821 

Thr Gin Gin Glu Leu Ser Lys Tyr Phe Glu Leu Lys Ser Thr Pro Ser 
1660 1665 1670 

att act att aga aaa eta tat att tgt tta tat tta tac ttt aag tct 5869 

He Thr He Arg Lys Leu' Tyr lie Cys Leu Tyr Leu Tyr Phe Lys Ser 
1675 1680 1685 1690 



ccc ttg gtt agg agg tta tta ata aat gat att aat att tta gta ctg 5917 
Pro Leu Val Arg Arg Leu Leu He Asn Asp He Asn He Leu Val Leu 
1695 1700 1705 

aaa ttg ttt gga gga gag aaa caa agt gac taatagaaaa atttatgtat 5967 
Lys Leu Phe Gly Gly Glu Lys Gin Ser Asp 
1710 1715 

gtcatactct ttatcattta ttgatttgtt tatataaaga agagatatat tcaaatttag 6027 

aaattattct ctcttcttct attcctgatg ttgataattt agagaaaaaa ttaaaatcaa 6087 



aaacaataaa tatacatatt ttagaagaat ctagtggtga aagtgaagaa ttattatcag 6147 



titUB ¥fiU 3# 1M 5B 
H31#-Sf = H 1 2- 2 1 1 A 4 H; 54/ 57 

tacttaaaga tgctggtcta agttatagta agtttgatag taattgtttt atttttaatg 6207 
atgcaacgcc tattgggagg acactaataa agcatggtat ttattataat ctaattgaag 6267 
atggtttaaa ttgttttact tactctatat ttagtcaaaa actttggaag tattatgtaa 6327 
aaaaatatat tcttcacaaa attcagccac atggattttc acgatattgt ttagggattg 6387 
aagttaattc attagttaat ttgccaaagg atccgcgtta taaaaaattt attgaagtcc 6447 
ctaggaaaga actttttgac aatgtaacag aatatcaaaa agaaatggca ataaatcttt 6507 
ttggagcagt aagagttagt attaaatcac cttcagtact agtattaacg cagcctctat 6567 
ctatagataa agagtttatg agttataaca ataagataga aacgtccgaa gaacaattta 6627 
atttttataa atcaatagtc aatgaatata taaataaagg gtacaatgtt tatttaaaag 6687 
ttcatcctag agatgtagta gattattcca aattgccggt agagctatta ccatcaaatg 6747 
ttcctatgga aattatagag ttgatgttaa caggtcggtt cgaatgtggg ataacacatt 6807 
cgtccactgc gctggatttt ttaacttgtg ttgataaaaa aataacttta gtagatct 6865 



<210> 4 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



Ilf-? = H 12-21 1A4 g : 55/ 57 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 4 

gggggatcca atggtattga aatacag 27 



<210> 5 
<211> 29 

<212> DNA • . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 5 

aatctgcaga cttagctcct gtcccgagt 29 



<210> 6 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<400> 6 

ccaagcggcc gctatagtca acttaaaagg tgg 



33 



mm^=u 1 2 - 2 1 1 A 4 



ilfcHB 



« 1 3^ 



1M 5B 
: 56/ 57 



<210> 7 
<21i> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 7 

cggctcgagt cccaataggc gttgcatc 28 

<210> 8 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 

ccggaattcg aaaaggtaaa gtgtctccga aa 32 

[mi] XhU7b^vjiX - 7-fi"7t?-*- Type IaflUfc .fcVType 
[02] 0 1 , 3-#7* h-xm&&m&m77X $ KpBBPIJ&iVpBB 



ilfctiB ¥gU 3^ IE 5 
F-?=H 1 2 - 2 1 1 A 4 M: 57/ 



P lac : lacT'P^E — # — 

cpsi : /? i , 3-N-r-fe^-;i/^;i/3t>-^ >mmmm&? 

cpsJ : /? 1 , 3- #7* h-^te^HS^Jg^^ 



^ = H 1 2 - 2 1 1 A 4 



M: 1/2 
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=^=H 1 2 - 2 1 1 A 4 



^tHB ¥gU 3^ in 5 B 
M: 2/2 



[12] 
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